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Beyond headscarf symbolism 


Turkey’s scientists show they no longer want to expend their energy on political confrontation, but 
political volatility is threatening their efforts to work in peace. 


students held a lunch-time demonstration in one of Ko¢ Uni- 

versity’s elegant courtyards. They were protesting against the 
administration's failure to protect the jobs of the university’s service 
workers. It was peaceful. No one tried to stop them. 

This is one side of Turkey. The other side is more visible — the violent 
response of police to the Taksim Square protests against unchecked 
city development a few weeks later, for example. And the long jail sen- 
tences dished out earlier this month to some former university rectors, 
dogmatic opponents of their country’s ban on headscarves in public 
institutions (see Nature 500, 129-130; 2013). The headscarf ban, and 
the reaction to it, is a telling and useful guide to how Turkish universi- 
ties and Turkish science could yet flourish amid such political volatility. 

Kog University is private, so the ban does not apply. Free to choose, 
only a single head was covered among the protestors. With reasonable 
budgets, reasonable student numbers and a reasonable absence of ideol- 
ogy; Kog University has little to worry about. Its three-year-old medical 
faculty has already attracted a European Research Council grant holder. 

A few kilometres farther south, the cash-starved, public Bogazici 
University is required to apply the headscarf ban, but the occasional 
defiant student who chooses to cover her head tends to go unnoticed. 
As elsewhere in Turkey, most academics and scientists just don’t want 
to expend energy on fights about headscarves any more. 

That is a U-turn in mentality, and an encouraging one. The head- 
scarf ban has been the symbol of a decades-long struggle between 
secularists who see covering the hair as a slippery slope towards a 
breakdown of Turkey’s constitutional separation of state and religion, 
and those who would like to see a greater presence of Islam in daily life. 


Ts spring in Istanbul, a hundred or so faculty members and 


Scientists tend increasingly to see this stand-off for what it is — less 
about religion than about power, and they are worried more about the 
ascension of ignorance than of fundamentalism. 

Turkey's economic growth and expanding cities have created a pow- 
erful new middle class with strong religious roots. Ten years ago they 
helped to elect Prime Minister Recep Tayyip Erdogan with his ‘mildly 
Islamic government, and Erdogan supporters now hold key positions 
at institutions such as the Turkish Academy and TUBITAK, Turkey's 
research funding agency. Critics say that these individuals are not always 
qualified for the posts, and that some may even have abused their posi- 
tions to take revenge on those who stood in their way in the past. 

TUBITAK has madea series of poor decisions in recent years. Most 
notorious was in 2009, when it demoted Cigdem Atakuman, the editor 
of its magazine Bilem ve Teknik, after she objected to its censorship 
of an article celebrating Charles Darwin. She has since challenged 
TUBITAK’s investigations in three separate court cases, which she 
won. More recently, the organization declined to fund a workshop 
on evolutionary biology, saying that evolution is controversial. Anti- 
evolutionism is a religious stance, and with this argument TUBITAK 
shows a failure to understand science and its processes. 

Against this alarming backdrop, Turkey’s small, mostly foreign- 
trained research community is trying to do science as it would be done 
in the countries where its members trained. They frequently succeed 
in small pockets, as at Kog¢ and Bogazici. But even at Kog, scientists 
complain about a lack of critical mass that threatens their long-term 
future. The current less-confrontational mood will help them to focus 
their energy. But they won't be secure until the government acknowl- 
edges that science, just like a headscarf, is not a political toy. m 


In addition 


Conflicts of interest and gaps in data 
contaminate US oversight of food additives. 


choose from more than 10,000 food additives to garnish the 

dish. Of these chemicals, 43% are labelled ‘generally recog- 
nized as safe’ (GRAS) and need not be approved by the US Food and 
Drug Administration (FDA). 

The system has weaknesses. A manufacturer is responsible for assess- 
ing whether an additive it has developed is GRAS. Once that is done, the 
manufacturer is asked — but not required — to notify the FDA. There 
are no data to evaluate compliance systematically, but the FDA found 
during a 2010 crackdown on caffeinated alcoholic drinks that four out 


A chef who crafts a delicacy for sale in the United States can 


of four manufacturers queried had not done the required checks. 

Even when manufacturers do submit GRAS determinations, there 
are concerns about the quality of the assessment. An ongoing project at 
the Pew Charitable Trusts in Washington DC reveals discomfiting gaps 
in the data. A search of three toxicological databases, including that of 
the FDA, showed that fewer than 38% of GRAS claims were backed 
up by FDA-recommended toxicology studies in animals (T. G. Neltner 
etal. Reprod. Toxicol. http://dx.doi.org/10.1016/j.reprotox.2013.07.023; 
2013). The same team has published an analysis of 451 GRAS notifica- 
tions submitted to the FDA. To avoid conflicts of interest, assessments 
should be done by an independent expert panel, but none had been; 
in fact, 22% had been performed by an employee of the manufacturer 
(T. G. Neltner et al. JAMA Intern. Med. http://doi.org/nd5; 2013). 

The FDA told Nature this week that it plans to issue guidance on 
how to fulfil GRAS requirements. One suggestion, proposed by Pew, 
is to take conflict-of-interest policies used to select FDA advisers, 
and apply them to GRAS determinations. The FDA should seize the 
opportunity to protect public health and boost confidence. = 
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maintain a healthy scepticism towards league tables and the 

metrics that underpin them. But now, officials at institutions 
that do well in such assessments — partly on merit, and partly because 
they use the English language or have other historical advantages — 
are becoming beguiled with quantitative measures to rate the perfor- 
mance of academic staff. People who care about genuine quality in 
research and teaching need to resist that shift. 

Universities evolved as self-governing bodies of academics. 
Originally, the president or vice-chancellor had a purely housekeeping 
role, once described in US parlance as assuring parking for the staff, 
sex for the students and sport for the alumni. 

But lately — not least in Britain, where schemes such as the Research 
Assessment Exercise have come to dominate 
academic life — power has moved from the 
departments to the vice-chancellor. And univer- 
sity leaders, flanked by research managers and 
associated flunkies, want to use metrics to shift 
that balance still further. 

Eight leading British universities are now ener- 
getically engaged in the joint development of a 
formidable computer tool that allows them to 
compare the performances of their researchers 
and departments against rivals, according to grant 
income, number of patents applied for, or pretty 
much any other criteria they choose. The tool is 
called Snowball (www.snowballmetrics.com) and 
the institutions signed up to it include the univer- 
sities of Oxford and Cambridge, Imperial College 
London and University College London. 

Like any metrics system, Snowball can, in theory, be used for good 
or ill. I suspect that in practice, however, it will end up being used 
mainly to exercise yet more control over academic staff, with every 
aspect of their professional lives tracked on the system. 

Although Snowball was developed by people of genuine integrity 
who want to establish a fuller understanding of research performance, 
it shares a fundamental defect with other quantitative research- 
assessment tools: it is largely built on sand. It cannot directly measure 
the quality of research, never mind teaching, so instead it uses weak 
surrogates, such as the citation indices of individuals. 

Citation indices — which rank research in terms of the average 
number of citations for articles — were robustly challenged earlier this 
year, when organizations led by the American Society for Cell Biol- 
ogy signed the San Francisco Declaration On Research Assessment 
(DORA), pledging to take a stand against the 
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ever-expanding reach of journal-based metrics. NATURE.COM 
One of DORAs best ideas is to ask that citation _ Discuss this article 

databanks be available openly, for all research- _ online at: 

ers to use. I wish them luck with that. University —_ go.niature.com/fpfpwg 
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The increasing dominance of quantitative research assessment threatens the 
subjective values that really matter in academia, says Colin Macilwain. 


managers know that information is power — and they want not just the 
data, but to dictate how they are manipulated. 

A major problem with metrics is the well-charted tendency for people 
to distort their own behaviour to optimize whatever is being measured 
(such as publications in highly cited journals) at the expense of what is 
not (such as careful teaching). Snowball is supposed to get around that 
by measuring many different things at once. Yet it cannot quantify the 
attributes that society values most in a university researcher — original- 
ity of thinking and the ability to nurture students. Which is not the same 
as scoring highly in increasingly ubiquitous student questionnaires. 

Senior scientists have known for along time that bogus measures of 
‘scientific quality’ can threaten the peer-review system that has been 
painstakingly built up, in most leading scientific nations, to distribute 
funds on the basis of merit. 

In the United States in 1993, for example, 
Congress passed the Government Performance 
and Results Act, which compelled federal agen- 
cies to start measuring their results. However, the 
US scientific establishment was strong and self- 
assured at that time, and successfully derailed 
the prospect that agencies such as the National 
Science Foundation (NSF) would start inventing 
numbers to ‘measure’ the work of its grant recipi- 
ents. Instead, the NSF sticks to measuring things 
such as time to grant. 

Nations with weaker scientific communities are 
less well-placed to fend off the march of metrics. 
The hazards are perhaps most immediate in 
places such as Italy, where peer review for grants 
has never fully taken hold, and China, where it 
has rarely even been tried. There is a worrying tendency in developing 
countries, especially, for research agencies to skip the nuanced business 
of orchestrating proper peer review, and to move straight to the crude 
allocation of funds on the basis of measured performance. This bypasses 
quality and, bluntly, invites corruption. 

But I see trouble ahead at the leading universities in the United King- 
dom and the United States, too. Their reputations were built by autono- 
mous academics, working patiently with students. If the name of the 
game becomes strong performance measured in numbers — as the vice- 
chancellors seem to want — it will kill the goose that laid the golden egg. 

Defenders of Snowball say they are baffled that scientists, given what 
they do for a living, remain sceptical of research-performance metrics. 
But science seeks to identify and measure good surrogates, to test falsifi- 
able hypotheses. Seen in that light, quantifiable research assessment does 
not measure up. Nevertheless, the snowball has started rolling down the 
mountain — and it is hard to see how its momentum will be arrested. = 


Colin Macilwain writes about science policy from Edinburgh, UK. 
e-mail: cfmworldview@gmail.com 
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Sinking ground 
poisons wells 


A previously unrecognized 
source of arsenic threatens 
Vietnam's clean water. 

Arsenic contamination 
generally affects wells at 
shallow depths, so wells 
deeper than 150 metres 
are often deemed safer. 
However, Steven Gorelick 
and colleagues at Stanford 
University in California 
suggest how arsenic 
contamination can occur deep 
underground. They analysed 
an area in the Mekong Delta 
of Vietnam, where many deep 
wells are contaminated. Using 
satellite measurements and 
simulations, they found that 
the land around deep wells has 
sunk by up to 27 centimetres 
since 1988 as groundwater has 
been pumped out. Clay layers 
adjacent to the pumped areas 
have compacted, which could 
be forcing water containing 
dissolved arsenic into deep 
aquifers. Wells that initially 
test as arsenic-free may not 
remain so, the authors say. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1300503110 (2013) 


Spider silk stalls 
asomersault 


A silk string helps leaping 
spiders to land fast on their feet. 
Jumping spiders, or salticids, 
can travel more than 20 times 
their body length in a single 
bound, usually with a dragline 
of silk, thought to function as 
a safety line, trailing behind 
them. Kai-Jung Chi at National 
Chung Hsing University in 
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Selections from the 
scientific literature 


Sand collisions kick up storms 


Sandstorms whipped up by desert winds owe 
their strength to mid-air collisions between 


sand grains. 


Marcus Carneiro and Hans Herrmann at the 
Swiss Federal Institute of Technology in Zurich 
and their colleagues used computer simulations 
to show that without collisions, sand grains 
whizz along close to the ground or make small 
hops. But when they encounter each other in the 


Taiwan and her colleagues 
showed that the dragline also 
provides stability, preventing 
the predatory spiders from 
pitching too far back through 
the air and so leaving them 
poised for action on landing. 
Time-lapse images of the 
salticid Hasarius adansoni 
showed that spiders with 
silk (pictured; left panels) 
maintained orientation during 
jumps: they landed feet-first 
and were ready to pounce 
within about 10 milliseconds. 
Salticids that did not produce 
silk (right panels) landed on 
their abdomens, slipped or 
tumbled, sometimes requiring 
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air, a few particles are bounced upwards off their 
neighbours and are accelerated by the stronger 


winds that blow higher up. These high-flying 


more than 50 milliseconds to 
regain their footing. 

J. R. Soc. Interface 10, 20130572 
(2013) 


Climate tracking 
by smartphone 


Scientists can gather climate 
data from smartphones in 
their owners’ pockets. 

Aart Overeem of 
Wageningen University, 
the Netherlands, and 
his colleagues exploited 
an existing Android 
smartphone app to collect 


‘saltons’ ultimately crash into the ground, kicking 
out sprays of sand that further fuel the storm. 

Such collisions can double the amount of 
material in the air, boosting the quantities that 
storms carry as they sculpt desert landscapes. 
Phys. Rev. Lett. 111,058001 (2013) 


and select around 1.3 million 
temperature readings from 
eight cities around the world, 
averaging more than 800 


readings per city each day. The 
app gathered data from phone- 


battery temperature sensors 
and uploaded the readings 
when a data connection was 
available. A simple heat- 
transfer model accounted for 
the effects of body warmth 
and clothing, and allowed 

the researchers to estimate 
daily temperatures in urban 
environments. The approach 
could be used to collect data 
at a finer scale and lower cost 
than is currently possible with 
weather stations, the authors 
suggest. 

Geophys. Res. Lett. http:// 
dx.doi.org/10.1002/grl.50786 
(2013) 
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PHYSIOLOGY 


Why sunburn 
stings 


A membrane protein in 

skin cells transmits signals 
that trigger pain and tissue 
damage that put the ‘burn’ into 
sunburn. 

Wolfgang Liedtke of Duke 
University in Durham, North 
Carolina, and his colleagues 
switched off production of 
acellular ion channel called 
TRPV4 in the epidermis of 
genetically engineered mice. 
They exposed the mice to 
ultraviolet (UV) radiation, 
which causes sunburn. 

Compared with UV-exposed 
mice that had normal TRPV4 
production, the engineered 
mice had less damage on 
their hind-paw pads (which 
resemble human skin). Their 
paws were also less sensitive 
to heat or to being poked with 
fine wires — signs that the 
engineered mice experienced 
less pain. Furthermore, three 
human patients with sunburn 
had higher expression of 
TRPV4 in their epidermis than 
three unburned controls. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/ 10.1073/ 
pnas.1312933110 (2013) 


Malaria vaccine 
protective at last 


A vaccine made ofa purified, 
weakened form of a malaria 
parasite gave 100% protection 
in asmall clinical trial. 

Of six people who were 
given five intravenous doses 
of the live-attenuated vaccine 
and then bitten by infectious 
mosquitoes, none developed 
malaria. Five out of six 
unvaccinated people did get the 
disease, as did three out of nine 
patients who received lower 
doses of vaccine. These levels of 
protection, reported by Robert 
Seder at the National Institutes 
of Health in Bethesda, 
Maryland, and his colleagues, 
surpass any previously 
achieved for a malaria vaccine. 
The vaccine will now be tested 
in larger field trials in Africa, 


which will determine how long 
protection lasts and whether it 
works against other strains of 
the malaria parasite. 

Science http://dx.doi.org/ 
10.1126/science.1241800 
(2013) 

For a longer story on this research, 
see go.nature.com/maeStu 


Toothy patch in 
fossil fish 


Jawless fish that swam the seas 
some 400 million years ago 
may have relied on the same 
mechanism to repair bone 
damage as modern mammals 
do to mend their teeth. 

A team led by Zerina 
Johanson at the Natural 
History Museum in London 
analysed fossils of the large 
fish Psammolepis from rock 
formations in Estonia and 
Latvia. Psammolepis are 
covered by bony shields 
ornamented with a thin layer 
of dentine, the same tissue that 
makes up teeth. One specimen 
had a damaged armour plate, 
possibly from a predator 
attack. The researchers were 
surprised to find that the bony 
plate was plugged not with 
bone, but with dentine. 

Specialized stem cells speed 
the regrowth of dentine in 
extant mammals such as 
humans and mice, and the 
researchers suggest that a 
similar mechanism existed in 
this toothless fish to repair its 
bony armour. 

Biol. Lett. http://dx.doi.org/ 
10.1098/rsbl.2013.0144 (2013) 


Genome mix-ups 
monitored 


Cancer cells often contain 
translocation mutations, in 
which parts of the genome 
are swapped between 
chromosomes. For the first 
time, scientists have been able 
to watch these accidents as 
they happen in living cells. 
Researchers led by Tom 
Misteli at the National 
Cancer Institute in Bethesda, 
Maryland, coupled 
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The most viewed 
papers in science 


Young female birds get the worm 


Birds’ ability to learn from watching their 
a anew peers varies with age, sex and social rank. 
acielrethe wast In a study led by Lucy Aplin at the 
90 days University of Oxford, UK, wild-caught 

blue tits (Cyanistes caeruleus; famous 
for learning to pierce foil caps on milk bottles delivered to 

British homes to get the cream) were placed in an aviary 

with a covered tray containing waxworms, the blue tit’s 

favourite snack. Eight experimental groups totalling 

56 birds could observe ‘demonstrator birds that had been 

trained to get worms by either flipping up or piercing 

the covers. About half of these birds learned to get the 

worms, but when 32 other birds were given the tray in 

the absence of demonstrators, none 

of them learned the skill. Juvenile 

females were almost twice as likely to 
learn the task as other blue tits, and 
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dominant males less likely. 


Anim. Behav. 85, 1225-1232 (2013) 


ultra-high-throughput 
techniques for time-lapse 
imaging with a system for 
‘snapping DNA strands in 
chromosomes, and watched 
what happened to DNA 
fragments in cultured cells. 
On rare occasions, the 

breaks occurred in separate 
chromosomes and the broken 
ends attached to the wrong 
partners. These translocations 
formed within hours of a break 
and were guided by the cell’s 
repair machinery. The work 
will allow further study of how 
and when chromosomal mix- 
ups occur. 

Science 341, 660-664 (2013) 


REPRODUCTION 


Sneaky breeders 
make sons 


Altruistic worker bees usually 
abstain from reproduction 
— unless they have a shot at 
making a royal baby. 

Michael Holmes at the 
University of Sydney in 
Australia and his colleagues 
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monitored the fertility and 
breeding habits of seven 
colonies of Western honeybees 
(Apis mellifera). It was 
thought that workers (who 
are all female) refrain from 
breeding and instead raise 
the queen’s brood, which is 
better for the colony’s overall 
success. But the researchers 
found that overall, 4.2% of 
males (drones) are produced 
by workers, a level 40 times 
greater than that typically 
reported. Furthermore, just 
before virgin queens set out 
to establish their own hives, 
aworker baby boom occurs 
and the number of worker- 
produced drones peaks at 
more than 6%. 

The authors speculate that 
workers may risk breaking rank 
for the chance of producing a 
new queens future mate. 

Mol. Ecol. 22, 4298-4306 
(2013) 
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Coronavirus clues 


Scientists have an early lead in 
the search for animal sources 
of the Middle East respiratory 
syndrome coronavirus 
(MERS-CoV), which has 
killed 46 people since last 
September. The virus is 
thought to have originated in 
bats, but could have spread 
to humans through one or 
more intermediate hosts. 
Researchers reported on 

9 August that of 50 Omani 
camels sampled, all showed 
evidence of previous 
infection with MERS-CoV 
or aclosely related virus 

(C. B. E. M. Reusken et al. 
Lancet Infect. Dis. http://doi. 
org/nfb; 2013). Further work 
is needed to verify whether 
camels could be a source of 
human infections. 


Stormy Atlantic 


The current Atlantic hurricane 
season, which began on 

1 June, could be unusually 
active, said the US National 
Oceanic and Atmospheric 
Administration in an updated 
forecast on 8 August. There 
have been four named storms 
so far this year. The agency 
calculates a 70% chance 

that a total of 13-19 named 
storms, including 3-5 major 
hurricanes, will develop 
before the Atlantic season 
ends on 30 November. These 
projections would exceed the 
30-year seasonal average of 
12 named storms, including 3 
major hurricanes. 


GM rice row 
Protesters uprooted a field of 
genetically modified (GM) 
‘golden rice’ at a Philippines 
Department of Agriculture 
compound in Camarines Sur 
on 8 August, citing concerns 
over future marketing and 
potential health effects. The 
crop is engineered to contain 
-carotene and helps to 


Astronomers image pink exoplanet 


A magenta exoplanet 17.5 parsecs from Earth 
is the lowest-mass planet that has ever been 
directly imaged orbiting a Sun-like star outside 
the Solar System, NASA announced last week. 
The Subaru Telescope on Mauna Kea, Hawaii, 
took pictures of the exoplanet GJ 504b at near- 
infrared wavelengths with the help of adaptive 
optics. GJ 504b is four times more massive 
than Jupiter and, with a surface temperature 


mitigate vitamin A deficiency 
— which causes malnutrition 
and affects 1.7 million children 
in the Philippines. The non- 
profit International Rice 
Research Institute, which was 
helping to conduct safety trials 
on the rice, vowed to continue 
its experiments. 


Landsat woes 

The future of Landsat, the 
world’s longest-running series 
of Earth-observing satellites, 
is in jeopardy, according toa 
report released on 8 August 
by the US National Research 
Council. Current methods 
of management and funding 
cannot sustain the land- 
imaging programme over 
the long term, the report 
warns. NASA launched the 
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US$855-million Landsat 8 
probe in February (see 

Nature 494, 13-14; 2013), 

but the space agency has not 
yet outlined clear mission 
objectives or secured sufficient 
funding for a successor. 
Landsat satellites have 
monitored Earth’s surface 
continuously since 1972. 


H7N9 virus persists 
China reported on 11 August 
its first new case of the H7N9 
avian influenza virus in three 
weeks: a 51-year-old woman 
from Huizhou in Guangdong 
province, who fell ill on 

27 July. Since the start of June, 
only three cases have been 
confirmed in China; 132 cases 
were confirmed from March 
to May. There have been 


of 237 °C, still glows pink from its fiery birth 
160 million years ago. The rosy planet (pictured 
in an artist's impression) orbits the star GJ 504, 
which can be seen with the naked eye, in the 
constellation Virgo. With an orbiting radius 
43.5 times Earth’s distance from the Sun, 

GJ 504b challenges current theories of how far 
away from stars large planets form. The findings 
will be published in The Astrophysical Journal. 


44 deaths, but no evidence of 
sustained human-to-human 
transmission, says the World 
Health Organization. 


Brain-implant trial 
A deep-brain stimulation 
(DBS) device that not only 
delivers electrical pulses, but 
also records brain activity 
simultaneously, has been 
implanted in a person for the 
first time. Until now, there 

has been a lack of data on how 
the brain responds to DBS, 
which is used to treat motor 
disorders such as Parkinson's 
disease, and is being tested for 
some psychiatric conditions. 
On 7 August, the device maker 
— Medtronic in Minneapolis, 


S. WIESSINGER/NASA GODDARD SPACE FLIGHT CENTER 


s Minnesota — announced 
2 the start of clinical trials for 
the system, with the first 


ui implantation in a person with 
= ; 
z Parkinson’ in Germany. 


Genetics lawsuit 
Ambry Genetics in Aliso 
Viejo, California, has 
countersued its competitor 
Myriad Genetics. Myriad, a 
medical diagnostics company 
in Salt Lake City, Utah, sued 
Ambry in July for infringing 
patents that Myriad holds on 
tests for cancer-associated 
mutations in the BRCA1 and 
BRCA2 genes. Last week, 
Ambry replied that Myriad’s 
patent claims were invalid in 
light of a June ruling by the 
US Supreme Court that 
human genes cannot 

be patented (see Nature 

498, 281-282; 2013), and 
countersued Myriad for 
suppressing competition to 
maintain a monopoly on the 
tests. 


PEOPLE 
NASA deputy out 


NASA deputy administrator 
Lori Garver (pictured) is 
resigning after four years, 
the agency announced on 

6 August. As second in 
command to NASA chief 
Charles Bolden, Garver had 
a major role in developing 
initiatives such as commercial 
space transportation and 

a mission to capture an 
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dioxide, nitrous oxide and 
methane drove atmospheric 


greenhouse gases to their highest 
recorded levels in 2012, according 
to the US National Oceanic and 
Atmospheric Administration 


(NOAA). CO, now accounts 


for about 80% of the increase in 
global-warming potential since 
1990, the agency says. In May, 


atmospheric concentrations 


of the gas surpassed 400 parts 
per million at NOAAs Mauna 
Loa Observatory in Hawaii (see 


Nature 497, 13-14; 2013). 


TREND WATCH 


Increased emissions of carbon 


asteroid. She has also advised 
US President Barack Obama 


on space policy during his 
campaign and presidency. 
Garver will become general 
manager at the Air Line 
Pilots Association, based in 
Washington DC. 


Misconduct finding 
A dermatology researcher 

at Case Western Reserve 
University in Cleveland, 

Ohio, has been sanctioned 

for misconduct by the US 
Office of Research Integrity 
(ORI). Pratima Karnik 
admitted submitting a grant 
application to the US National 
Institutes of Health that 
contained text plagiarized 
from a grant application she 
had reviewed, as well as from 
eight research articles anda 
patent application. The ORI, 
in Rockville, Maryland, noted 
on 6 August that Karnik has 
agreed to have her research 


supervised, and to exclude 
herself from any peer- 
reviewing or advising for US 
public-health services for two 
years. 


US biofuel quotas 


Ina long-awaited decision, 
on 6 August the US 
Environmental Protection 
Agency (EPA) announced 
the nation’s biofuel quotas 
for 2013. Biofuels — mostly 
those based on ethanol from 
maize (corn) — must account 
for at least 9.74% of the total 
US fuel supply this year, up 
from 9.23% in 2012. However, 
the percentage of cellulosic 
biofuels — those made from 
inedible plant fibres such as 
wood — was reduced from 
last year’s target of 0.006% 

to 0.004% after a federal 
court ruled in January that 
the fledgling industry would 
not be able to meet the EPA’s 
demands. 


Food safety concern 
Additives that help to colour, 
flavour and preserve food 

are often officially deemed 
safe by experts with financial 
ties to the manufacturers, 
according to data published 
on7 August (T. G. Neltner 

et al. JAMA Intern. Med. 
http://doi.org/nd5; 2013). The 
study, led by researchers at 

the Pew Charitable Trusts in 
Washington DC, found that 


RECORD MEASUREMENT OF GREENHOUSE GASES 


The US National Oceanic and Atmospheric Administration’s 
Annual Greenhouse Gas Index (AGGI*) rose by 1.5% last year. 
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*The AGGI, a measure of global-warming potential, is gauged relative to baseline levels in 1990. 
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SEVEN DAYS | THIS WEEK | 


20 AUGUST 

AUS presidential 
bioethics commission 
discusses the ethical 
implications of 
neuroscience and its 
applications at a public 
meeting in Philadelphia, 
Pennsylvania (see 
page 261). 
go.nature.com/zrthim 


22 AUGUST 

US federal agencies reach 
the deadline to file their 
proposals for increasing 
public access to federally 
funded research, set by 
the Office of Science 
and Technology Policy 
in Washington DC (see 
Nature 494, 414-415; 
2013). 


employees or hired consultants 
of food companies made 35% 
of the 451 determinations 
classifying food additives 

as safe in the United States 
between 1997 and 2012. 

The authors have called 

on the US Food and Drug 
Administration to bar the 
practice. See go.nature.com/ 
jqccur and page 253 for more. 


Space fence down 
Citing the US federal budget 
cuts known as sequestration, 
the US Air Force Space 
Command announced on 

12 August that it will shut 
down a key component ofa 
network that monitors orbital 
debris and space objects. The 
Air Force Space Surveillance 
System, dubbed the space 
fence, will close by 1 October. 
Two of the system’s nine radar 
stations were deactivated in 
April. Losing the rest will make 
it harder to track break-ups of 
space junk, which can endanger 
satellites, says Brian Weeden, 
technical adviser at the 

Secure World Foundation in 
Washington DC. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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Optical stimulation of light-responsive neurons in engineered mice can be used to create false memories. 


NEI 


+ 


EN Ce. 


brain project 


puts focus on ethics 


Unsettling research advances bring neuroethics to the fore. 


BY HELEN SHEN 


he false mouse memories made the 

ethicists uneasy. By stimulating certain 

neurons in the hippocampus, Susumu 
Tonegawa and his colleagues caused mice 
to recall receiving foot shocks in a setting in 
which none had occurred’. Tonegawa, a neuro- 
scientist at the Massachusetts Institute of Tech- 
nology in Cambridge, says that he has no plans 
to ever implant false memories into humans — 
the study, published last month, was designed 


just to offer insight into memory formation. 

But the experiment has nonetheless alarmed 
some neuroethicists. “That was a bell-ringer, 
the idea that you can manipulate the brain to 
control the mind,” says James Giordano, chief 
of neuroethics studies at Georgetown Univer- 
sity in Washington DC. He says that the study 
is one of many raising ethical concerns, and 
more are sure to come as an ambitious, multi- 
year US effort to parse the human brain gets 
under way. 

The BRAIN (Brain Research through 


Advancing Innovative Neurotechnologies) 
Initiative will develop technologies to under- 
stand how the brain's billions of neurons work 
together to produce thought, emotion, move- 
ment and memory. But, along with the dis- 
coveries, it could force scientists and society 
to grapple with a laundry list of ethical issues: 
the responsible use of cognitive-enhancement 
devices, the protection of personal neural data, 
the prediction of untreatable neurodegenera- 
tive diseases and the assessment of criminal 
responsibility through brain scanning. 

On 20 August, US President Barack Obama's 
commission on bioethics will hold a meeting 
in Philadelphia, Pennsylvania, to begin to craft 
a set of ethics standards to guide the BRAIN 
project. There is already one major mechanism 
for ethical oversight in US research: institu- 
tional review boards, which must approve any 
studies involving human subjects. But many 
ethicists say that as neuroscience discoveries 
creep beyond laboratory walls into the market- 
place and the courtroom, more comprehensive 
oversight is needed. “The long-term conse- 
quences of more brain knowledge — whether 
it’s good for an ethnic group or threatens your 
personal identity — there's sort of no one in 
charge of that,” says Arthur Caplan, director of 
medical ethics at New York University’s Lan- 
gone Medical Center. 

Tonegawas study adds to the growing evi- 
dence that memories are surprisingly pliable. 
In the past few years, researchers have shown 
that drugs can erase fearful memories’ or 
disrupt alcoholic cravings in rodents’. Some 
scientists have even shown that they can intro- 
duce rudimentary forms of learning during 
sleep in humans’. Giordano says that dysto- 
pian fears of complete human mind control are 
overblown. But more limited manipulations 
may not be far off: the US Defense Advanced 
Research Projects Agency (DARPA), one of 
three government partners in the BRAIN 
Initiative, is working towards ‘memory 
prosthetic’ devices to help soldiers with brain 
injuries to regain lost cognitive skills. 

Deep brain stimulation (DBS), in which 
implants deliver simple electrical pulses, is 
another area that concerns neuroethicists. The 
devices have been used since the 1990s to treat 
motor disorders such as Parkinsons disease, and 
are now being tested in patients with psychiat- 
ric conditions such as obsessive-compulsive 
disorder and major depression. Giordano says 
that applying DBS technology more widely 
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> requires ethical care. “We're dealing with 
things affecting thought, emotion, behaviour 
— what people hold valuable as the essence 
of the self he says. 

Neuroethicists are noticing challenges 
beyond the medical system, too, particu- 
larly in the courtroom. Judy Illes, a neurol- 
ogy researcher at the University of British 
Columbia in Vancouver, Canada, and co- 
founder of the International Neuroethics 
Society, says that brain imaging could affect 
the criminal-justice system by changing defi- 
nitions of personal responsibility. Patterns 
of brain activity have already been used in 
some courtrooms to assess the mental fit- 
ness of the accused. Some ethicists worry 
that an advanced ability to map human brain 
function might be used to measure an indi- 
vidual’s propensity for violent or aberrant 
behaviour — or even, one day, to predict it. 

At next week’s meeting, the presidential 
commission will hear from each of the US 
agencies involved in the BRAIN Initia- 
tive — DARPA, the National Institutes of 
Health and the National Science Founda- 
tion — about preliminary scientific plans 
and anticipated ethical issues. Lisa Lee, the 
commission's executive director, says that 
the group plans to discuss broad ethical con- 
cerns for human and animal participants in 
neuroscience research, and also the societal 
implications of discoveries that could arise 
from the BRAIN Initiative. Although no 
specific timeline has been set, the commis- 
sion typically holds three to four meetings 
over a period of up to 18 months, culminat- 
ing in recommendations to the President. 

As neuroethicists wade into the issues, 
they may look to the precedent set by the 
Human Genome Project’s Ethical, Legal 
and Social Implications (ELSI) research 
programme, which has provided about 
US$300 million in study support over 
23 years. The programme raised the profile 
of genetic privacy issues and laid the foun- 
dations for the Genetic Information Non- 
discrimination Act of 2008, which prohibits 
discrimination by employers and health 
insurers on the basis of genetic information. 

Thomas Murray, one of the architects of 
ELSI and president emeritus of the Hast- 
ings Center, a bioethics research institute 
in Garrison, New York, is among the speak- 
ers invited to the commission meeting. He 
considers the BRAIN Initiative a timely 
opportunity to develop an ELSI programme 
for neuroscience. “There will be wonder- 
ful questions about human responsibility, 
human agency,’ he says. “It’s never too soon 
to begin.” m 
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INNOVATION 


BY DECLAN BUTLER 


Russian revolution in scientific 
Amerstr — or a white elephant? 
Bulldozers are rumbling near Moscow, 
at work on the Skolkovo Innovation Centre, 
an ambitious, multibillion-dollar scheme to 
boost Russia’s moribund innovation system. 
Scientists have high hopes for the project’s 
first goal: to build a world-class technology 
university from scratch in a few years. How- 
ever, they are more sceptical about the pros- 
pects of a planned commercial science park 
at the site, and some have baulked at the high 
price — a reported US$300 million — paid to 
a US institution to jump-start the university. 
Meanwhile, allegations of corruption at the 
innovation centre's umbrella body, the Skol- 
kovo Foundation, risk throwing a cloud over 
the entire enterprise. 


Launched by the Russian government in 
2010 with 85 billion roubles (US$2.6 billion) in 


2013 
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The Skolkovo Institute of Science and Technology will open near Moscow next year (artist’s impression). 


Russia pins hopes 
on science city 


But sceptics question prospects for Skolkovo commercial park. 


state funding until the end of 2014, the Skolk- 
ovo supercampus will rise on a 400-hectare 
site just west of the Moscow ring road. On 
1 August, the government announced that it 
intends to put a further 135.6 billion roubles 
into the venture by 2020. Focusing on five areas 
— information technology, nuclear technology, 
energy efficiency, biomedical innovation, and 
space and telecommunications — Skolkovo is 
the boldest of Russia’s efforts to spur high-tech 
innovation and reduce the country’s economic 
dependence on exports of oil, gas and minerals. 
The intellectual lynchpin of the enterprise 

is the Skolkovo Institute of Science and Tech- 
nology (Skoltech), an elite, English-language, 
graduate and research university being created 
in partnership with the 


NATURE.COM Massachusetts Institute 
Formoreonattempts of Technology (MIT) 
to boost Russian in Cambridge. It aims 
science, see: to foster a new genera- 
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top-notch Russian researchers and engineers 
trained in translational research and entre- 
preneurship as well as basic science, to fuel an 
ecosystem of innovation that the country sorely 
lacks. It plans to recruit 200 full-time faculty, 
300 postdocs and 1,200 students by 2020. 

Coaxing leading scientists to Russia is a chal- 
lenge, so Skoltech is offering compensation 
packages that exceed those available in the West 
“by a wide margin’, says Konstantin Severinov, a 
molecular biologist and Skoltech associate dean 
of faculty. Skoltech hopes to have 30-35 faculty 
members by the end of this year, and then to add 
30 or so annually until it reaches its goal. 

The most immediate difficulty is practical: 
Skoltech’s first buildings will become available 
only next year. For now, MIT and other part- 
ners are teaching the first students, who started 
studies last September, and faculty members 
are working elsewhere. The challenge, says Raj 
Rajagopalan, a chemical engineer and Skoltech’s 
dean of faculty, is “how to create an academic 
community when most people are not here”. 

It was a deliberate choice to start Skoltech’s 
activities in parallel with construction, says 
Duane Boning, an electrical engineer and 
MIT’s faculty lead on Skoltech. The plan will 
shave years off the time needed to get the uni- 
versity up and running, he says. It has allowed 
Skoltech to put its administrative, curriculum 
and research strategies in place quickly, adds 
Severinov, who has recruited three postdocs, 
a programmer and one PhD student for his lab, 
with more to come soon. But “we really need 
those buildings here ASAP”; he adds. 

The terms of MIT’s four-year contact with 
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Skoltech, which ends in 2015, are confidential, 
but those familiar with the project say that Rus- 
sia is paying MIT at least $300 million for its 
services. Several members of the Skolkovo 
Foundation’s scientific advisory board origi- 
nally objected to that price tag, say sources. 
Some argued that more services could be had 
for less, or from another partner — and others 
felt that Russian institutions should have been 
given a greater role in the project. 

Alexei Sitnikov, vice-president of institu- 
tional and resource development at Skoltech, 
confirms that some board members expressed 
reservations, but he defends the deal. “The 
benefits which we get exceed, by far, the costs 
that we pay,” he says. Several scientists involved 
say that, although the price can be debated, 
MIT has been effective in developing the ele- 
ments needed for the institute to get going: 
curricula, research programmes, administra- 
tive and recruitment structures, and mecha- 
nisms for entrepreneurship and innovation. 
MIT has also run Skoltech’s international calls 
for proposals for research centres of excellence 
(15 are planned). “Sometimes you can pay a lot 
for a brand name,’ adds Rajagopalan. After its 
contract ends, MIT will shift to a collabora- 
tor role and Russian institutions will be more 
involved in Skoltech, adds Sitnikov. 

Building Skoltech “is exactly the right thing 
to do’, says Leonid Levitov, a Russian physicist 
at MIT who is not directly involved in Skoltech’s 
creation. At first, he doubted that MIT should 
take a role, given the political complexity, 
bureaucracy and corruption that exist in Russia. 
But he is impressed at 
how Skoltech, and 
MIT’s contributions 
to it, are shaping up. 
Historically, much of 
the country’s most 
innovative research 
was linked to govern- 
ment labs, includ- 
ing those run by the 
military, but most 
have collapsed since 


“We really need the fall of Soviet com- 
those building. s munism. These days, 
here ASAP. there are few graduate 


Konstantin Severinov opportunities. Many 


of the best students 
leave Russia, and even more switch to non- 
research careers, he says. Skoltech could be a 
place for them to train in research at home. 
The institute’s ultimate impact will depend 
heavily on reforms elsewhere in Russian 
science, as well as on the success of the Skolk- 
ovo science park. Without it, there will be few 
meaningful career opportunities for the elite 
researchers that Skoltech will produce, and the 
institute could end up as “a completely crazy 
brain-draining scheme’, says Severinov, with 
Russia training first-rate people who then leave. 
The park offers companies incentives includ- 
ing tax breaks and visa help. Twenty-eight > 
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> major partners, including Siemens, IBM, 
Intel and Microsoft, have pledged to invest 
a total of $500 million, which will be used, 
in part, to establish corporate research and 
development centres at Skolkovo. More than 
960 start-up companies have signed on to set 
up shop there, and around 60 venture-capital 
firms, more than one-third of them from other 
countries, have pledged a total of $600 million, 
says Leonid Gankin, a spokesman for Skolkovo. 

But with construction still under way, start- 
ups will not be able to begin moving to Skolkovo 
until 2014. Only one corporate centre has been 


built: that of Cisco Systems, a networking- 
equipment firm based in San Jose, California, 
which opened its Skolkovo facility in June. A 
report from the European Bank for Reconstruc- 
tion and Development in London last Decem- 
ber warned that although Skolkovo offers 
benefits, initiatives in which states try to pick 
winning industries have a chequered history. 
Skolkovo’s reputation took a hit this year, 
with allegations of corruption at the Skolkovo 
Foundation. In February, the Investigative 
Committee of the Russian Federation, the 
country’s main criminal-investigation authority, 


launched enquiries into the alleged embezzle- 
ment of 24 million roubles by Skolkovo officials. 
Investigators raided the foundation's Moscow 
offices on 18 April. 

Levitov and other Russian scientists warn 
against jumping to conclusions before the 
charges are substantiated, arguing that in 
Russia, criminal allegations can often mask 
political intrigues and power struggles. Still, 
Rajagopalan worries that some prospective 
faculty members might ask themselves whether 
they really want to take the risk of decamping 
there. “Perceptions count, he says. m 


Europe sets sights on lasers 


Three eastern European countries are gearing up to host powerful light sources. 


BY DEVIN POWELL 


ast month, a large European nation 
L=« an order for a pair of titanium- 

sapphire lasers. With an output of 
10 petawatts, each would be about nine times 
more powerful than the strongest laser in exist- 
ence today — capable of exploding the nuclei 
of heavy elements. 

Nearly as special as the lasers is the place 
where they will be delivered: Magurele, 
Romania, a town of 11,000 people that was 
once home to a Soviet-era research reactor. In 
June, workers broke ground at the construc- 
tion site for a new research centre to house the 
twin lasers. 

It would not be possible for the former 
Eastern bloc nation to afford the project’s 
€356-million (US$475-million) cost: expen- 
ditures on research and development in 
Romania are among the lowest in Europe. 
Instead, the European Union (EU) is expected 
to pick up most of the tab, as it plans to for 
two other facilities, in Hungary and the Czech 
Republic. The three projects, along with a 
fourth still on the drawing board, will make 
up the Extreme Light Infrastructure (ELI) — 
an experiment in using EU structural funds 
to advance research. 

“With ELI, we enter a new era, pioneering 
the use of structural funds for an interna- 
tional research infrastructure,” says Wolfgang 
Sandner, who heads a consortium set up by the 
three countries and Italy to manage ELI. 

Structural funds, which make up more 
than one-third of the EU budget, are aimed at 
reducing economic disparities across Europe. 
They typically pay for local projects such as 
road repair, construction of power lines or 
clean-up of industrial waste — not super- 
powerful lasers. And although any member 


GUIDING LIGHTS 


The three laser-research sites of the Extreme Light Infrastructure (ELI) rely 
on European infrastructure funds typically meant for civic projects. 


EL] BEAMLINES 


Cost: €278 million 


Power: Four lasers, 
up to 10 petawatts each 


Focus: Biomedical 
applications, materials 
science, plasma physics 


Status: Construction began 
in October 2012 


Tyrrhenian 


Sea 


Power: Multiple lasers, 
up to 2 petawatts each 


Focus: Short pulses for fast 
snapshots of electrons 


Status: Construction 
expected to begin in October 


Lae 


ELI ATTOSECOND 


Cost: €245 million 


Black 
Sea 


ELI NUCLEAR PHYSICS 


Cost: €356 million 


Power: Two lasers, 
10 petawatts each 


Focus: Nuclear physics 


state can apply for these infrastructure funds, 
most of the money goes to the poorest countries. 

ELI, by contrast, is an international research 
institute that will encourage scientists to do 
laser experiments at higher energies, intensi- 
ties and speeds. Both France and the United 
Kingdom wanted to host the project, marked 
as a priority in a 2006 road map for European 
research. But structural funds provided the 
Czech Republic, Romania and Hungary with 
a way to pay for it. Advocates for using the 
money in this way touted the potential of ELI 
to stimulate economic growth. Romania hopes 
to attract technology companies and establish 
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Status: Construction began 
in June 


Magurele as a ‘laser valley’; the region is also 
planning to build a smaller laser facility to train 
its scientists to participate in ELI. 

But the structural funds come with strings 
attached. For starters, the money must be 
spent by 2015, and meeting that deadline is 
proving difficult. After Romania was selected 
for ELI in 2009, it took two years to decide 
how to shift structural funds away from other 
projects, delaying the start of construction. 
And Hungary submitted a formal funding 
proposal only in May. That request has yet to 
be approved. 

Using structural funds to buy high-end 


SOURCE: EUROPEAN COMMISSION/ELI 


lasers has also been tough, says Vlastimil 
Ruzicka, managing director of the Czech 
facility, to be built near Prague. That is partly 
because these funds usually go towards civic 
projects and products with clear prices and 
capabilities. Working out the uncertainties 
involved in developing cutting-edge lasers has 
required protracted negotiations with labora- 
tories. “Included in the contracts are penalties 
for not delivering,’ says RUzi¢ka, who has spent 
nine months working out a deal to buy a laser 
from the Lawrence Livermore National Labo- 
ratory in Livermore, California. 

The three countries have asked the European 
Commission to approve a phased approach that 
would bring some lasers online in 2015 and 
others in 2017. Adrian Curaj, general director 
of Romania's Ministry of Education, Research 
and Youth in Bucharest, says he is confident 
that this approach will pan out. “Everyone 
wants a success story here,’ he says. 

Each branch of ELI will specialize in a 
different kind of laser science (see ‘Guiding 
lights’). Romania's intense lasers will explore 
nuclear physics, and the Czech lasers will 
accelerate particles to make X-ray and ultra- 
violet light sources, useful in exploring the 
structure of molecules and solid materials. 
One laser will fire exceptionally rapid pulses 
to excite electrons in a plasma. Those elec- 
trons then shed that energy, emitting a 


laser-like, or coherent, beam of X-rays. 
Most sources of coherent X-rays, such 
as the European X-ray free-electron laser 
(XFEL), under construction near Hamburg, 
Germany, use microwaves to accelerate elec- 
trons over a long distance — 1.7 kilometres, 
in the case of the XFEL. By contrast, ELI’s 
compact X-ray laser 


“Fachtimelaser technology will fit 
pulses have on a tabletop ina 
small laboratory. 
abbas canine Its designers hope 
é ah b that a successful 
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we have learned 


of laser-based X-ray 
sources. The pulses 
that ELI makes will 
be complementary to those at XFEL — less 
energetic but spread over a broader spectrum 
— and will be useful for probing different 
kinds of atoms in a material simultaneously, 
for instance. 

Hungary’s lasers, to be housed in a former 
military barracks in the town of Szeged, will 
also produce X-rays. But here the goal will 
not be to make pulses coherent, but to make 
them as short as possible — each lasting less 
than one-quadrillionth of a second. At this 
time scale, the pulses can excite a target atom’s 
electrons, which can then be tracked by the 


new things.” 
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X-rays. “Each time laser pulses have gotten 
shorter, a new physical regime has been 
accessed and we have learned new things,” 
says Todd Ditmire, director of the Texas Peta- 
watt Laser project at the University of Texas at 
Austin and chief executive of National Ener- 
getics, a laser company based in Austin that 
has plans to bid on building one of Hungary's 
lasers. 

As the ELI trio takes the stage, a fourth ELI 
project waits in the wings: a facility for relativ- 
istic physics, underpinned by an even mightier 
laser that would reach 200 petawatts of power. 
This titan’s beam would be strong enough to 
distort empty space — enabling researchers 
to test the idea that a vacuum can behave like 
a crystal and bend light. The powerful beam 
could probe the nature of virtual particles as 
they pop in and out of the void and search for 
radiation that is thought to be given off by an 
object accelerating through a vacuum. 

How to make that laser is still an open ques- 
tion — as is where to build it and how to pay 
for it. The fourth site might go to one of the 
countries that is already participating. Another 
option is Russia, which has proposed building 
a ‘super-laser’ from 12 smaller ones. “It was, 
and remains, ELI’s strategy to first gain some 
experience about the feasibility of different 
technologies at the three existing pillars before 
narrowing down the options,” says Sandner. = 
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PHARMACEUTICALS 


India spurns cancer patents 


Nation seeks to cap high cost of drugs to treat non-infectious diseases. 


BY ERIKA CHECK HAYDEN 


nce the scourge of the developing 

world, infectious diseases such as 

malaria, tuberculosis and AIDS can 
now be fought with cheap drugs. But as peo- 
ple in poorer nations live longer and adopt 
Western habits, non-communicable diseases 
such as heart disease, diabetes and cancer 
have become the main killers — and pay- 
ing for their treatment has become a thorny 
problem. 

India may now be drawing a line in the 
sand. In the past three weeks, officials there 
have refused patents on two breast cancer 
drugs — the latest in a series of decisions to 
limit patents on pricey brand-name medica- 
tions. These moves reflect a tension: India 
now surpasses the United States in terms of 
annual cancer deaths, and wants to find ways 
to treat the disease cheaply. But this desire 
runs counter to the goals of drug makers, who 
see middle-income nations as central to their 
growth plans. 

The first of the recent rejections occurred 
on 27 July, when an Indian federal board of 
patent officials revoked a patent ona slightly 
modified version of the breast cancer drug 
lapatinib, sold as Tykerb by London-based 
pharmaceutical firm GlaxoSmithKline. 
Then, on 4 August, Swiss drug company 
Roche reported that 


a patent officeinthe “Wethink the 
city of Kolkata,ahub answer is to 

of the national pat- make the price 
ent system, would of drugsreally 
not grant patentson cheap.” 


a version of the com- 

pany’s drug trastuzumab, sold as Herceptin. 
Indian officials allowed other patents that 
will protect both drugs from generic com- 
petition until 2019. But the rulings will stop 
the companies from extending their patent 
protection beyond that date, opening a win- 
dow for manufacturers of generic drugs to 
then step in. 

The fight echoes one in the late 1990s and 
early 2000s over drugs for treating infections 
such as HIV. That dispute was largely resolved 
when drug makers allowed developing-world 
companies to create cheap generic medicines. 
Today, antiretroviral treatments can be bought 
for less than US$100 a year, compared with 
more than $10,000 a year in 2000, according to 
international aid organization Doctors With- 
out Borders (Médecins Sans Frontiéres), based 
in Geneva, Switzerland. 


DRUG MONEY 


By 2016, drug makers could be earning as much 
from emerging markets, such as China, Brazil and 
India, as they do from selling in the United States. 


United States ™ Europe 
m= Rest of world s Emerging markets 
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1,200.---.- 
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Emerging markets: China, Brazil, India, Russia, Mexico, Turkey, 
Poland, Venezuela, Argentina, Indonesia, South Africa, Thailand, 
Romania, Egypt, Ukraine, Pakistan and Vietnam. 


But drugs for non-communicable diseases 
— particularly cancer — will be much trickier 
to negotiate. “There’s no easy compromise 
that’s going to arise around non-communi- 
cable diseases the way that we saw around 
HIV,’ says Thomas Bollyky, a lawyer with the 
Council on Foreign Relations in New York 
who fought for affordable HIV medicines in 
the 1990s. 

In India, a $15,000 course of trastuzumab 
can cost more than ten times the average 
annual wage. And there are no older, off-pat- 
ent drugs that could serve as an alternative, 
because none of them target the specific type 
of breast cancer as well as trastuzumab. 

Yet drug makers are reluctant to cut prices in 
middle-income countries such as India, China 
and Brazil, which are projected to account for 
much of the industry’s growth in the near 
future (see ‘Drug money’). Although Africa's 
ability to pay for HIV drugs was never going 
to be high, some people in middle-income 
nations can afford expensive medicines. Drug 
makers do not want to erode that niche mar- 
ket through lower-cost drugs, even if the vast 
majority of people in need cannot pay, says 
James Love, director of Knowledge Ecology 
International, a non-governmental organiza- 
tion in Washington DC that advocates for social 
justice in access to knowledge. 

A number of ideas to skirt the impasse 
have been floated, but none are simple. Drug 
makers argue that governments of middle- 
income nations should broaden insurance 
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programmes and access to health care. They 
also argue that drugs will become more 
affordable as economies grow and people 
earn more money. By reducing support for 
intellectual property, India is undermining 
incentives for drug development and foreign 
investment that will allow for growth, says 
Amy Hariani, director and legal policy coun- 
sel for life sciences at the US-India Business 
Council, an industry group based in Wash- 
ington DC. “The best way for the Indian 
economy to grow is by rewarding innovation,’ 
she says. 

Another idea comes from the World Health 
Organization, which for the past five years has 
been trying to broker an international treaty 
that would see member states supporting 
the development of lower-cost medicines 
with prizes and research funding rather than 
patents. “We think the answer is to make the 
price of drugs really cheap, and to provide 
funding as a reward for innovation rather 
than through a monopoly on a drug,’ says 
Love. 

There is also increasing pressure on drug 
companies to adopt pricing models that allow 
people in the same country to be charged dif- 
ferent prices for drugs, depending on their 
ability to pay. Companies, including Roche in 
the case of trastuzumab, say that they already 
offer such differential pricing through special 
access programmes. Still, Roche’s own figures 
show that it sold enough trastuzumab last 
year to treat only 3,700 Indian breast cancer 
patients — 15% of those who need it. 

The battles may end up being mere skir- 
mishes if India goes further and allows local 
companies to disregard Roche's trastuzumab 
patent altogether and manufacture a cheaper 
generic version, using a ‘compulsory licence. 
Last year, India issued such a licence on a 
cancer drug sold by German firm Bayer. And 
in January, India’s ministry of health recom- 
mended compulsory licences for trastuzumab 
and two other cancer drugs. 

Indonesia issued compulsory licences for 
seven drugs in 2012, and China and the Phil- 
ippines have tweaked their laws to make such 
licences easier to issue. Prashant Yadav, direc- 
tor of the health-care research initiative at the 
University of Michigan in Ann Arbor, says 
that these moves portend an unsettling future. 
India may be the main battleground today, but 
the war over cancer-drug access seems likely 
to bleed beyond its borders unless a compro- 
mise is reached. “This requires some kind of 
diplomacy now,’ says Yadav. m 


SOURCE: IMS INSTITUTE FOR HEALTHCARE INFORMATICS 


Elizabeth Loftus has spent decades exposing flaws in eyewitness testimony. 
Her ideas are gaining fresh traction in the US legal system. 


n the early hours of 9 September, 1984, 

a stranger entered Mrs M's California 

home through an open living-room 

window. Finding Mrs M asleep, he tried 

to rape her, but fled when other people 

in the house awoke. Mrs M described 
her assailant to the police: he was black, weigh- 
ing about 170 pounds and 5’7” to 5’9” tall, with 
small braids and a blue baseball cap. 

Officers cruising her neighbourhood spot- 
ted someone roughly matching that descrip- 
tion standing beside his car a block away from 
the house. The man, Joseph Pacely, said that his 
car had broken down and he was looking for 
someone to jump-start it. But Mrs M identified 
him as her attacker and he was charged. 

At Pacely’s trial a few months later, mem- 
ory researcher Elizabeth Loftus testified on 
his behalf. She told the jury how memory is 
fallible; how stress and fear may have impaired 
Mrs M’°s ability to identify her assailant, and 
how people can find it difficult to identify 
someone ofa race other than their own. 

Pacely was acquitted. “It’s cases like this that 
mean the most to me,’ says Loftus, “the ones 
in which I play a role in bringing justice to an 
innocent person.” 

In a career spanning four decades, Loftus, 
a psychologist at the University of California, 
Irvine, has done more than any other 
researcher to document the unreliability of 
memory in experimental settings. And she 
has used what she has learned to testify as an 
expert witness in hundreds of criminal cases — 
Pacely’s was her 101st — informing juries that 
memories are pliable and that eyewitness 
accounts are far from perfect recordings of 
actual events. 

Her work has earned her plaudits from her 
peers, but it has also made her enemies. Critics 
charge that in her zeal to challenge the verac- 
ity of memory, Loftus has harmed victims and 
aided murderers and rapists. She has been sued 


BY MOHEB COSTANDI 


and assaulted, and has even received death 
threats. “I went to a shooting range to learn how 
to shoot,’ she says, noting that she keeps a few 
used targets in her office as a point of pride. 
Now, the 68-year-old scientist’s research is 
starting to bring about lasting changes in the 
legal system. In July last year, the New Jersey 
Supreme Court issued a ruling — based largely 
on her findings — that jurors should be alerted 
to the imperfect nature of memory and the fal- 
libility of eyewitness testimony as standard pro- 
cedure. Loftus is working with judges in other 
states to make such changes more widespread. 
“What's going on now in America really is 
something ofa revolution,” says Martin Conway, 
a cognitive psychologist at City University Lon- 
don. Loftus’ work, he says, has been “profoundly 
important” in shaping these changes. 


MALLEABLE MEMORIES 

Loftus says that her start in psychology was 
rudderless. As a graduate student in math- 
ematical psychology at Stanford University 
in California, “I wasn't really captivated”, she 
says. “Td sit in the back of the seminars, kind of 
bored, writing letters to my Uncle Joe or hem- 
ming skirts, or whatever.” 

Eventually a social-psychology class piqued 
her interest and she began to study how word 
meanings are stored in the brain, and how peo- 
ple recall them. Still, something was missing. 
“One day I was having lunch with a cousin of 
mine,’ she says, “and I told her about our great 
discovery that people are faster at naming ‘a bird 
that’s yellow’ than ‘a yellow bird”? Her cousin — 
unimpressed — joked about taxpayers’ money 
being wasted. “That’s when I decided I wanted 
to work on something 


that had more practical NATURE.COM 
applications.” For a slideshow 
Loftus was casting and podcast about 
about for a meaningful _ Elizabeth Loftus, see: 
way to study memory _ go.nature.com/8cgujw 
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and get funding when a former Stanford 
engineer working for the US Department of 
Transportation said that his employer would 
probably pay for research into car accidents. 

Following that lead, Loftus won funding in 
1974 for a proposal to study witness accounts 
of accidents, and she soon published the first of 
several influential studies revealing the limita- 
tions of eyewitness testimony’. She showed peo- 
ple film clips of car accidents and asked them to 
estimate the speed of the cars. The wording of 
the questions, she found, had a profound effect 
on the estimates. People who were asked, “How 
fast were the cars going when they smashed into 
each other?” gave higher estimates on average 
than those with whom the verb ‘hit’ was used. 
And those who were told that the cars had ‘con- 
tacted’ each other gave the lowest estimates. 

Those asked about cars smashing into one 
another were more than twice as likely as oth- 
ers to report seeing broken glass when asked 
about the accident a week later, even though 
there was none in the video. “I realized that 
these questions were conveying information,” 
says Loftus. “I began to think of it as a process 
of memory contamination, and we eventually 
called it the misinformation effect.” 

She went on to publish several other studies” * 
showing how memories can be contorted, and 
that the ability of eyewitnesses to identify sus- 
pects from photographs can be unreliable. Any 
description they might hear has the potential 
to influence who or what they think they saw. 

Loftus was eager to translate these findings 
to the real world, and began consulting on legal 
cases to get “close up and personal” with wit- 
nesses. Her first case — of a woman accused 
of killing her abusive boyfriend — hinged on 
whether the woman had acted in self-defence or 
had committed premeditated murder. Eyewit- 
nesses could not agree on how much time had 
elapsed between when the defendant had picked 
up the gun and when she had fired it: some said 
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it was seconds, others said minutes. Loftus cast 
doubt on the memory of the witnesses, and the 
woman was acquitted. 

Loftus described the case, together with 
her research, in a 1974 article for Psychology 
Today magazine’. “Once that article came out, 
I started getting calls from all over the place,” 
she says. “From lawyers wanting me to work on 
their cases, and legal professionals wanting me 
to lecture at their meetings.” 

Some of her trials have been high-profile 
— including that of the serial killers known as 
the Hillside Stranglers and the 1992 trial of the 
police officers indicted for beating construc- 
tion worker Rodney King. She even consulted 
ona case involving a young law student named 
Ted Bundy, who was accused of kidnapping a 
woman in 1974. Bundy was convicted, only 
to escape. Recaptured in 1978, he eventually 
admitted to killing 30 people. 

The possibility of aiding guilty people does 
not faze Loftus. “I haven't had a situation 
where someone was acquitted because of my 
testimony and then went on to commit some 
awful crime,’ she says. “I would feel horrible if 
that happened, but I’m only one small piece of 
acourt case.’ She is often compensated for her 
expert-witness work, earning up to US$500 
per hour, she says. 

Nita Farahany, a bioethicist at Duke Univer- 
sity School of Lawin Durham, North Carolina, 
says that Loftus’s activism in the court is not 
unique, and that testifying on behalf of unpop- 
ular defendants is important. “It shows that she 
has tried to be truly impartial, and that her goal 
is to try to provide an accurate understanding 
of the science, no matter who is involved” 

Still, Loftus has drawn the line at some 
defendants, such as John Demjanjuk, who in 
1988 stood accused in Israel of being ‘Ivan the 
Terrible; a guard who operated gas chambers 
at the concentration camp Treblinka in Poland 
during the Second World War. Loftus, herself 
Jewish, declined to testify because she worried 
that it would upset family and friends. 

The case led some to accuse her of double 
standards. But those criticisms were mild com- 
pared with the reactions that she would soon 
trigger in her most controversial legal work. 


DIGGING UP THE PAST 


In 1990, Loftus got a call from a California 
attorney defending George Franklin, whose 
daughter claimed that during therapy, she had 
recovered decades-old memories of him mur- 
dering her friend, Susan Nason. Loftus decided 
to consult for the defence team. “I thought it 
was pretty fishy and started looking into the 
literature,’ she says. She found little convinc- 
ing research to support the idea that traumatic 
memories could be repressed for years. 
Franklin was convicted despite her testimony. 
He spent five years in prison before an appeals 
court reviewed and then overturned his convic- 
tion amid doubts over his daughter's statements. 
The courts went on to see a surge in cases 
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based on recovered childhood memories, 
fuelled in part by popular books and high- 
profile accusations. Loftus began to wonder 
whether it was possible to fabricate complex, 
believable memories. “I wanted to see if we 
could implant a rich memory of an entirely 
made-up event,’ she says. An idea eventually 
came to her as she drove past a shopping mall. 

Working with a student, Jacqueline Pickrell, 
Loftus recruited 24 people and, with the 
cooperation of family members, presented them 
with four detailed accounts of events from their 
childhood. Three of the incidents had actu- 
ally taken place, but the fourth — a dramatic 
account of being lost in a mall — was entirely 
concocted by Loftus and corroborated by the 
participants’ relatives. One-quarter of the par- 
ticipants claimed to remember the false event’. 


BATTLE GROUND 

Loftus became convinced that well-meaning 
psychotherapists could inadvertently implant 
false memories into patients’ minds, and her 
subsequent testimonies led to a row between 
therapists who believed their patients were 
recovering lost memories and researchers who 
thought something else was afoot. To try to set- 
tle these ‘memory wars, the American Psycho- 
logical Association (APA) commissioned an 
expert report about the subject, to be written 
by three memory researchers, including Lof- 
tus, and three clinical psychologists. 

The groups could not agree, and each ended 
up writing a separate report. “It was very polar- 
izing,’ says Stephen Ceci, a developmental psy- 
chologist at Cornell University in Ithaca, 
New York, who worked with Loftus on one 
of the reports. 

There are ways in which traumatic 
memories of real events can be recalled after 
being buried for years, he adds, but without 
hard evidence, it is impossible to distinguish 
false memories from real ones in court. It 
is, therefore, possible that some claims of 
childhood abuse go unvindicated because of 
Loftus’ testimony, and this is the cause of much 
of the hostility towards her. 

Ross Cheit, a political scientist at Brown Uni- 
versity in Providence, Rhode Island, started the 
Recovered Memory Project in 1995 to docu- 
ment and respond to what he says has been a 
one-sided debate. There are now more than 
100 corroborated cases of recovered memory on 
his website (http://blogs.brown.edu/recovered- 
memory), he says, including some on which 
Loftus had consulted. 

“Loftus is often on the losing side, and she’s 
sometimes wrong in a spectacular way,’ Cheit 
says. Her testimonies, he adds, can be psycho- 
logically damaging for the victims. “If you’re 
telling someone you think their memories are 
false, when they have corroborating evidence 
that they were abused, that’s corrosive.” 

Loftus does not believe that Cheit’s site cor- 
roborates recovered memories. “He might have 
some cases of people who didn’t think about 


their abuse for some time and were reminded 
of it, but as for actual repression, no,’ she says. 
“I cringe at the idea of hurting genuine victims, 
but when an innocent person is accused, we 
have a whole new set of victims, and ’m more 
horrified by an innocent person getting con- 
victed than bya guilty person being acquitted” 

But her testimonies and investigations into 
recovered memories have strained her profes- 
sional relationships. Towards the end of 1995, 
two women filed formal complaints against 
Loftus with the APA. Lynn Crooks and Jennifer 
Hoult had won civil suits in cases involving 
recovered memories of childhood sexual abuse, 
and both claimed that Loftus had distorted the 
facts of their cases in articles and interviews. 
Loftus resigned from the APA and critics specu- 
lated that she had caught wind of the complaints 
and left before a formal investigation could take 
place. But Loftus chalks her resignation up to 
political disagreements, saying she knew noth- 
ing of the complaints at the time. 

In 1997, Loftus and several colleagues began 
to dig into a published case study describing 
an anonymous subject, ‘Jane Doe, who had 
apparently recovered a repressed memory of 
childhood abuse’. They found information 
that cast doubt on her account, but before they 
could publish, Doe contacted the University 
of Washington in Seattle, where Loftus was 
working, and accused the team of breaching 
her privacy. 

The university confiscated Loftus’s files, put 
her under investigation for nearly two years and 
prevented her from publishing. She was eventu- 


| BEGAN TO THINK OF IT 


AS A PROCESS OF MEMORY 


CONTAMINATION.” 


ally cleared, and published the work® in 2002. 
The next year, however, Doe sued Loftus and 
her collaborators for fraud, invasion of privacy, 
defamation and causing emotional distress. 

It was at around that time that Loftus moved 
to the University of California, Irvine. The Jane 
Doe case was eventually settled in 2007, when 
the Supreme Court of California dismissed all 
but one of the charges and Loftus agreed to pay 
a nuisance settlement of $7,500. “It was sucha 
stressful time, but I can’t really say it was detri- 
mental overall? says Loftus. 

Her work has now moved from trying to 
affect single cases to pushing for broader 
changes in the legal system. Loftus has been 
working with Pennsylvania trial judge Jeannine 
Turgeon to compile a set of guidelines similar 
to those instituted in New Jersey last year. They 
instruct jurors that memory “is not like a video 
recording” and ask them to consider the many 
factors that can alter memories, such as the pres- 
ence of a weapon, which can draw attention 
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away from the perpetrator’s face. 

“This has the potential to be really impor- 
tant,” says Farahany. “Using cutting-edge 
research to undercut the idea that memory is as 
stable and precise as people believe it to be can 
really help us get to a place where we have bet- 
ter truth-seeking in criminal cases,’ she says. 

Loftus wants to go further. Almost every 
stage of the legal process — from the identi- 
fication and questioning of suspects to cross- 
examination of eyewitnesses in the courtroom 
— is prone to error. In a line-up, for example, 
police officers can influence identification, but 
this can be avoided if someone who does not 
know the identity of the prime suspect con- 
ducts the line-up (see Nature 453, 442-444; 
2008). “Id like to see this kind of thing being 
implemented, and to keep educating people 
about the workings of memory,’ says Loftus. 


MIND CONTROL 

Meanwhile, her research has shifted into new 
controversial waters. Taking on board the les- 
son that memories can be manufactured, she 
has been investigating the possibility of using 
those memories to modify behaviour”. 
“We've shown that you can plant a memory of 
getting sick eating particular foods as a child,” 
she says, “and we can get people thinking they 
got sick drinking vodka, so they don’t want to 
drink as much of it later on” 

There is no evidence that any of this will 
successfully transfer from the lab to the real 
world. Even if it does, it would violate thera- 
pists’ code of conduct, and could have unfore- 
seen consequences. 

“Lying to children is a slippery slope that 
makes me uncomfortable,’ says Judy Illes, 
a neuroethicist at the University of British 
Columbia in Vancouver, Canada. “Can't 
we alter their behaviour in a positive way, 
instead of using subterfuge?” But Loftus dis- 
misses the concerns, suggesting that even if 
therapists cannot do it, parents might want 
to. “Parents lie to their kids all the time, about 
Santa Claus and the tooth fairy. Would you 
rather have an unhealthy kid, or one with a few 
false memories?” m 


Moheb Costandi is a freelance science writer 
in London. 
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The unique Florida Everglades deserve protection, despite being highly vulnerable to climate change. 


Climate change must not 
blow conservation off course 


Reconfiguring protection priorities around global warming could be of limited use or 
even harmful, say Morgan W. Tingley, Lyndon D. Estes and David S. Wilcove. 


released its national strategy for conser- 

vation planning in the face of climate 
change. The first goal of the National Fish, 
Wildlife and Plants Climate Adaptation 
Strategy is to “conserve habitat to support 
healthy fish, wildlife, and plant popula- 
tions and ecosystem functions in a chang- 
ing climate”. It warns that by prioritizing 
certain species and areas over others, there 
will be winners and losers. Elsewhere in the 
conservation community, there is increas- 
ing acceptance of abandoning protection 


lE March this year, the United States 


for many of the species, populations and 
ecosystems that are most at risk of extinction 
from climate change’. 

Although climate change seems likely to 
wreak havoc on biodiversity, as is already 
happening in some places, its precise effects 
are difficult to predict. In the meantime, 
numerous threats that are better understood 
and more immediate — notably, the direct 
destruction of habitats — continue to drive 
species towards extinction. 

The best conservation response to global 
warming is not to beat an orderly retreat 


while saving the strongest, but to consider 
climate change as one of a suite of maladies, 
all of which must be addressed to protect 
biodiversity. In some cases climate change 
may be the most urgent threat; in most cases 
it is not’. 


MOUNTAIN BIAS 

In the growing literature on conservation 
and climate change, the golden word is ‘resil- 
ience. Conservationists hope to take actions 
that help organisms and ecosystems to sur- 
vive in a warmer and more volatile world. 
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> Because the resilience of individual 
species to climate change is difficult to pre- 
dict, researchers have suggested prioritizing 
regions that are expected to be climatically 
stable for at least the next 100 years’. In the 
past five years, scientists have undertaken 
global and regional mapping of climate- 
change velocity to identify these ‘refugia’*. 

Climate-change velocity indicates how 
fast (for instance, in kilometres per year) and 
in what direction temperatures are shifting 
across a landscape. Velocity calculations 
suggest that in a changing climate, topo- 
graphically diverse areas such as mountain 
chains will be more climatically stable than 
less-complex terrain such as flatlands. In 
mountainous areas, a wide range of tem- 
peratures occurs in a relatively small area. 
Thus, the distance an organism must migrate 
to remain at a constant temperature in a 
changing climate is much shorter in rugged 
country than in flat terrain. 

Our concern is that using metrics such as 
climate velocity to guide conservation prior- 
itization may well prove ineffective, or even 
harmful, given the enormous uncertainty 
over how species will actually respond to 
climate change. Meta-reviews assessing the 
impact of global warming on hundreds of 
species around the world can give the impres- 
sion that organisms are uniformly marching 
uphill or to higher latitudes in step with 
changes in mean temperature’. But averaging 
behaviour across diverse species to find over- 
all trends may be of little use in predicting 
what will actually happen in any one location. 

In fact, the closer scientists look at species’ 
ongoing responses to our warming planet, 
the more surprises they uncover®. For 
instance, the ranges of only 51% of the bird 
species in California’s Sierra Nevada moun- 
tains moved to higher elevations between 
1911 and 2009, despite a 1-2°C mean tem- 
perature increase over that period’. The 
remaining species either shifted to lower 
elevations or did not move. 

Basing conservation priorities on an area's 
expected resilience to global warming, and 
thus biasing protection towards mountain- 
ous landscapes, risks potentially losing the 
diversity of flatlands. On the whole, flatter 
ecosystems tend to be in greater need of pro- 
tection than rougher terrain; national parks, 
for example, more commonly exist in moun- 
tainous areas because such regions are less 
suited to cultivation and pasturing’. 

Prioritizing conservation according to 
an areas expected resilience at a national or 
continental scale could mean favouring the 
Rocky Mountains, the high Andes or Mount 
Kenya over Florida’s fabled Everglades, 
the cerrado of Brazil or Africa's savannahs. 
Although topographically diverse areas may 
contain larger numbers of species per unit 
area than flatlands, many areas of flat terrain 
are just as important for endemic species. The 


Brazil’s cerrado is a tropical savannah that has one of the richest endemic floras in the world. 


Brazilian cerrado, for example, comprises 
more than 2 million square kilometres and 
has one of the richest endemic floras in the 
world. Yet only around 2% of it is currently 
protected, and around half the total area has 
already been cleared for agriculture*”. 


THREAT BY THREAT 

To move forward, conservationists should 
consider all threats to biodiversity when 
deciding which species, habitats or areas to 
protect, and should not assume that some 
threats are more important than others. For 
example, for any conservation area, each 
potential threat should be evaluated and 
weighted by the risk it poses, with full con- 
sideration of its severity, reversibility, immi- 
nence, pace and certainty. 

Finding ways to combine these factors 
into a meaningful metric remains a for- 
midable challenge. Various methods exist 
to help conservationists to factor climate- 
change uncertainties into their priority- 
setting, but as yet there is no consensus 
on how the future threat of climate change 
should be compared to ongoing and more 
certain threats, such as land-use change. 
Creating a consistent approach for prioriti- 
zation should bea primary goal. The Inter- 
national Union for Conservation of Nature 
Red List of Threatened Species could be a 
good model, because it simultaneously 
evaluates the risk of multiple threats, from 
pollution to invasive species. 

In the meantime, there are several ways for 
conservation planners to guard against over- 
compensating for climate threats. When it is 
possible to assess the uncertainty associated 
with a particular threat — a 20% reduction 
in rainfall by 2050, say — this uncertainty 
can be factored into prioritization schemes. 
In other situations, models predicting an 
optimal prioritization scheme can be run 
several times, with and without different 
threats incorporated. Basing prioritization 
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on results obtained from a range of models, 
each with different parameters, is more likely 
to result in smart conservation. 

Given the sobering picture that has long 
been emerging from studies and reports 
such as those produced by the Intergovern- 
mental Panel on Climate Change, it is not 
surprising that conservationists feel the need 
to alter course to deal with the threat of cli- 
mate change. Indeed, some may argue that 
there is little point in trying to save species 
from immediate threats when their long- 
term survival is in question. But if conserva- 
tion planners dont use resources efficiently 
to maximize the protection of biodiversity 
and habitats from today’s threats, there may 
be little left to protect from the effects of 
global warming 50 to 100 years from now. m 
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The Bet examines 
contrasting views of the 
future, as in this 1980s 
New Yorker cartoon. 
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ENVIRONMENTAL POLICY 


The biggest wager 


Jon Christensen unpacks the fraught story of a biologist, an economist, and the 
polarization of US environmental policy. 


lenged biologist Paul Ehrlich to bet 

on the future price of a basket of raw 
materials then worth US$1,000. Ehrlich and 
two of his colleagues chose five metals crucial 
to the economy at the time: chromium, cop- 
per, nickel, tin and tungsten. Ehrlich bet that 
prices would rise because of increasing scar- 
city and demand. Simon bet that they would 
not. The loser would pay the difference in 
price after a decade. Simon was at greater 
risk: prices could have risen indefinitely. But 
Ehrlich lost: in October 1990, he sent Simon 
acheck for $576.07. End of story, right? 

Not so fast, argues historian Paul Sabin, 
who in The Bet attempts to use their wager 
to narrate parallel biographies of Simon 
and Ehrlich, as well as a US political envi- 
ronmental history of the past half-century. 
The result is a revealing tale. We see the 
evolution of Simon’s ‘cornucopian’ view of 
ongoing population growth supported by 
human ingenuity; his open conflict with 
Ehrlich’s steadfast ‘neo-Malthusiar’ argu- 
ment that there are real ecological limits 
to growth; and how both standpoints have 
influenced US environmental politics. So 
extreme were their stances that they could 
be viewed as fomenting a “bitter contest 


le 1980, economist Julian Simon chal- 


over the future’, Sabin writes. 

For instance, Ehrlich famously predicted 
in his 1968 book The Population Bomb 
(Ballantine Books) that hundreds of mil- 
lions would starve to death in the coming 
years. A year later he said, “By the year 
2000 the United Kingdom will be simply 
a small group of impoverished islands, 
inhabited by some 70 million hungry 
people ... If | were a gambler, I would take 
even money that England will not exist in 
the year 2000.” By contrast, Simon viewed 
the population explosion as “a triumph 
for mankind”. Humanity, Simon believed, 
was “the ultimate resource”. He held that 
human enterprise would continue to solve 


| 


Paul Ehrlich. 


Julian Simon. 


The Bet: Paul emerging problems 
Ehrlich, Julian and improve life on 
Simon, and Our Earth, and as resource 
Gamble over lise ditiinished 
Earth’s Future supplies diminished, 
PAUL SABIN prices would rise, 
Yale University Press: driving discovery of 
2013. more reserves or the 


creation of substitutes. 
In The Bet, Sabin traces these competing 
ideas through the energy crisis of the late 
1970s, attempts at US immigration reform 
in the 1980s and the stand-off over climate 
change in the 1990s. He sees echoes of the 
conflict embodied in Simon and Ehrlich’s 
wager in Jimmy Carter's loss to Ronald Rea- 
gan in the 1980 presidential election, the 
environmental movement's ongoing lack 
of a unifying leader, and even the paralys- 
ing political divide over climate change in 
the United States today. Ultimately, the bet 
is used to explain the whole messy evolution 
of US environmental politics from the early 
1970s, when Republican Richard Nixon was 
an environmental champion, to today, when 
Republican environmentalists are an endan- 
gered species. 
Sabin recognizes the complexities of the 
larger political story he tries to tell. He was 
a participant on its edges as director of the 
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non-profit Environmental Leadership 
Program before joining the history fac- 
ulty at Yale University in New Haven, 
Connecticut. And he acknowledges 
that there are other explanations for the 
divide in thinking on US environmental 
policy: the shift of the Democratic and 
Republican parties to more ideologically 
consistent blocs, business-led backlashes 
against environmental regulation, and 
the success of conservative think tanks 
in staking out positions in public- 
policy debates on these issues. But he 
wants to convince 
the reader that 
the clash is not 
played out only in 
the corridors of 
power. The wager, 
he asserts, “stands 
for much, much 
more — our col- 
lective gamble on 
the future of humanity and the planet”. 

Sabin uses the bet as a synecdoche — 
a narrative device in which a part stands 
for a whole, in this case environmen- 
tal politics. In the end, this is simplistic 
and blurs cause and effect, explanation 
and interpretation. Sabin bemoans the 
polarization that is the very structure of a 
wager, as if it were an important cause of 
the larger divide. But bets can be useful 
— they can clarify what is at stake and, by 
doing so, help us to frame thinking about 
the future of the planet. 

On a personal level, this is a sad tale 
of two very smart men who talked past 
each other for years, until in one telling 
moment they put their convictions on 
the line. After their wager was decided, 
they descended into ad hominem attacks. 
It is depressing to think that Sabin might 
be partly justified in making this a meta- 
phor for the environmental politics of 
our age. 

On the other hand, there is the story 
of John Holdren, a secondary character 
in The Bet. A physicist who co-authored 
several books with Ehrlich and joined his 
wager against Simon, Holdren is now US 
President Barack Obama’s senior science 
adviser. How does Holdren’s boss bet on 
the future? “I can't predict what will hap- 
pen over the next 40 years,” Obama once 
replied when asked to wager his own bet. 
But, he continued, “I am — and always 
will be — full of hope about what the 
future holds” = 


Jon Christensen is an adjunct 

assistant professor in the Institute of the 
Environment and Sustainability and the 
Department of History at the University 
of California, Los Angeles. 

e-mail: jonchristensen@ioes.ucla.edu 
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Perpetual Disunity, a photo by Mark Kessell from his Perfect Specimens series. 


Mark Kessell 


L f l 
Medic-turned-artist Mark Kessell creates prints evoking evolution and human development 


using the early photographic form of the daguerreotype. As his latest show opens in New York, he 
talks about shooting portraits of primates, forceps, the nearly dead and the newly born. 


What is Perfect 
Specimens? 

It’s a photographic 
life cycle of Homo 
sapiens. I tried to 
make images that 
) anyone could rec- 
ognize as distinctly 
human, showing 
conception, fetal 
development, birth, 
senescence and death. I want the photographs 
to raise questions about what it means, froma 
biological perspective, to be human. 


Can you tell me about the fetal specimens? 

There is a bizarre little human skeleton from 
around 1890, with enlarged eye sockets and 
no brain. The way it is mounted in a bottle, 
grinning upwards and perched on a little 
spike, seems to mock the tragedy for baby 
and mother. There is also a perfectly nor- 
mal four-month-old fetus that is uncurled 
and standing upright and looks like a little 
alien. You can see the blood vessels under 
its translucent skin; the top of the skull 


2013 
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has not yet fused. In another bottle there 
are tiny identical triplets, spontaneously 
aborted. These little guys once shared the 
same blood, but now they are forever alone. 


How do you capture birth and death? 

For birth, you find a brave woman with a 
generous partner, and as the baby comes out 
you stand between her legs with a camera. 
It’s unforgettable and inspiring, but also dis- 
turbing as that huge head emerges through 
such a small aperture. For death, I get per- 
mission from relatives to document the final 
moments of their loved one. One photo 
shows the lips of awoman who has spoken 
her last words. Many avoid the subject, but 
death is a biological process, so why pretend? 


What part does your medical training play? 

I didn't take pictures until I was 39, when, on 
a road trip around Australia, my girlfriend 
handed me a disposable plastic camera. 
Within a year, Id given up medicine and was 
attending art school in Manhattan. At first, 
I imagined shooting remote landscapes for 
National Geographic, but then I thought, 


MARK KESSELL 


ILLUSTRATION BY JIM SPENCER BASED ON PHOTO BY MARK KESSELL 


Mark Kessell: ‘Why fight my past?’ 
Perfect Specimens I thought back to 
Last Rites Gallery, Egbert, the cadaver I 
Manhattan, New dissected in medical 


York. 17 August-2 1 


school. He was sliced 
September. 


lengthwise so you 
could see the brain 
inside the skull. How vulnerable this source 
of our humanness seemed. And when we had 
to dissect a human arm, I was awestruck by 
the perfection of the hand, built with a sophis- 
tication far beyond technological ingenuity. 


Are you questioning whether we are unique? 
Most behaviours that were considered 
uniquely human until Darwin’s time exist 
in other species. Crows have a sense of 
play, form social alliances and consider the 
future in their behaviour. In my series Unmet 
Friends, | explore our capacity to under- 
stand other creatures by showing primates, 
birds and reptiles in poses that appear to 
express human emotions. In Specimen Box, 
a thousand photos of animal specimens are 
pinned to the walls, floor and ceiling of a 
small room. Everything in the room is dead 
except for the human observers, whose curi- 
ous faces are captured by hidden cameras. 


What other subjects have you covered? 

For Florilegium, 1 photographed surgical 
instruments, such as facelift forceps and 
urethral dilators, to make them look alive. 
These are the most menacing of my images. 
(One was used as the poster for a 2005 horror 
movie, Hostel, which I am not brave enough 
to see.) I’m not sure I realized how disturbing 
they were while shooting. These instruments 
are designed to rend human flesh and alter 
the body that largely defines our identity. 


Does your work overturn assumptions? 

I want people to examine their assumptions 
the way a scientist does. So in every series 
there is something that isn’t what it seems. 
Many viewers perceive the instruments 
in Florilegium as botanical specimens. In 
Unmet Friends, animal faces may be miscon- 
strued as human and vice versa. In Perfect 
Specimens, it can be hard to tell dead bodies 
from living ones. I want people to ask, ‘Why 
is the artist showing me this?’ 


Why use the daguerreotype process? 

It’s an incredibly beautiful process, but very 
hard to control. The result depends on how 
the silver plate is polished, and on the inten- 
sity of the chemicals and light. Areas of over- 
exposure turn an exquisite shade of blue. As 
a doctor, I was taught to control and stand- 
ardize. But a daguerreotype can't be precisely 
replicated. Because it is a pure silver plate, 
it is also a mirror. You'll see your own face 
reflected. Pll see mine. m 


INTERVIEW BY JASCHA HOFFMAN 
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Books in brief 


: What Makes a Hero? The Surprising Science of Selflessness 
— Elizabeth Svoboda CURRENT (2013) 

I Science writer Elizabeth Svoboda examines the super-altruism we 
call heroism, drawing on current research to unravel its biological 
and environmental roots. Bioengineer Kar! Deisseroth, for instance, 
uses light-sensitive algal proteins to pinpoint the brain circuits 
that control nurturing impulses in mice. And psychologist Philip 
Zimbardo isolates efficient problem-solving in tight spots as central 
to heroic acts. Weaving in gripping case studies — such as Christoph 
von Toggenburg’s lifelong fund-raising for the vulnerable — 
Svoboda concludes that heroes are made, not born. 


How Things Shape the Mind: A Theory of Material Engagement 
Lambros Malafouris MIT PRESS (2013) 

Is the mind imprisoned in the brain? In this mix of neuroscience and 
philosophy, Lambros Malafouris suggests that mind and materiality 
are allied in ways that defy reductive world views. He argues that the 
act of making objects such as knapped tools in prehistory shaped 
neural processes; and furthermore, that this is a two-way street 

in which “human intelligence ‘spreads out’ beyond the skin into 
culture”. Engrossing, from his analysis of how the interplay of hands, 
neurons, clay, a moving wheel and social context result in a thrown 
pot, to the links between a blind man’s stick and brain plasticity. 


Future Bright: A Transforming Vision of Human Intelligence 
Michael E. Martinez OXFORD UNIVERSITY PRESS (2013) 

Intelligence can be learned, argues educationalist Michael Martinez. 
Noting that tackling planetary crises hangs, in part, on the focused 
work of many agile minds, Martinez shows how such a revolution is 
possible. Beginning with the deep structure of cognitive ability, such 
as ‘fluid’ intelligence — the capacity to adapt to the unfamiliar — he 
paces the landscape of intellect. His journey reveals tools for change, 
including the rich variety of intelligence, from social to creative; the 
brain’s extraordinary adaptability; and the cultivation of supportive 
behaviours such as the urge to excel. 


The Neanderthals Rediscovered: How Modern Science is 
Rewriting Their Story 

Dimitra Papagianni and Michael A. Morse THAMES & HUDSON (2013) 
Recent findings in geology, genetics and archaeology have radically 
changed our understanding of Neanderthals. In the first complete 
chronological narrative of the species from emergence to extinction 
(perhaps 250,000 to 25,000 years ago), archaeologist Dimitra 
Papagianni and science historian Michael Morse have shaped a 
gem. Our big-brained relatives buried their dead, cared for the 
disabled, hunted creatively and ate grains, wild herbs and even 
dolphins. A beautifully synthesized portrait of a powerful people. 


The Way of Science: Finding Truth and Meaning in a Scientific 
Worldview 

Dennis R. Trumble PROMETHEUS (2013) 

The popular perception of science hinges largely on medical and 
technological advances. Biomedical engineer Dennis Trumble calls 
for more: a widespread reconnection to science as a way of knowing 
ourselves and the world. Rationality and critical thinking, he asserts, 
are moral pathways. Quoting Charles Darwin’s “there is grandeur in 
this view of life”, Trumble aims — without histrionics — to urge the 
religious towards science as a source of meaning. Barbara Kiser 
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HFEA responds on 
germline therapy 


As chief executive of the 

UK Human Fertilisation 

and Embryology Authority 
(HFEA), I take issue with 
Marcy Darnovsky’s criticisms 
of our public consultation on 
mitochondria replacement in 
reproductive therapy (Nature 
499, 127; 2013). 

Darnovsky queries the statistic 
used by the HFEA: “one in 200 
children is born each year with a 
form of mitochondrial disease” 
This comes from a study of 
mitochondrial mutations in the 
general population (H. R. Elliot 
et al. Am. J. Hum. Genet. 83, 254- 
260; 2008) and includes children 
with mild or no symptoms. Other 
UK organizations use the same 
figure, notably the National 
Health Service (see go.nature. 
com/dcocm2) and the Nuffield 
Council on Bioethics (see 
go.nature.com/s6z8gk) . 

Darnovsky also questions parts 
of the HFEA’s methodology and 
conclusions. The consultation was 
designed with external experts, 
and enabled public participation 
in various ‘strands’ that used 
qualitative and quantitative 
analysis. Our aim was to go 
beyond ‘yes’ or ‘no’ responses. In 
all but one of five strands, most 
people supported mitochondria- 
replacement techniques. Our 
publicly open questionnaire 
drew more opposition from its 
self-selected respondents, with 
some arguing that any in vitro 
manipulation of embryos is 
inappropriate or unethical. 

All the consultation material 
is available at our website (www. 
hfea.gov.uk/7784.html). I suggest 
that people judge for themselves. 
Peter Thompson HFEA, London. 
peter.thompson@hfea.gov.uk 


Ireland’s research 
output set to slide 


In the first global Nature 
Publishing Index (see go.nature. 
com/vbqdbr), Ireland is ranked 
20th in the Global Top 100 for 


2012 and is identified as one of 
five countries to watch, on the 
basis of a marked increase in 
the number of its publications 
in Nature Publishing Group 
research journals. Watchers 
could be disappointed when the 
next index is released. 

Ireland's research output 
is under threat. Research 
career frameworks are being 
formulated and implemented by 
several major Irish universities. 
These effectively limit 
researchers’ careers to six years 
through the enforcement of an 
‘up or out’ policy, whereby they 
must either secure increased 
funding to rise through the 
ranks, or leave the university. 

There is now a real risk that 
Ireland's talented researchers 
will be forced out as they reach 
the end of their allotted research 
time. They are left with the choice 
of going into industry, moving 
abroad to continue their research, 
or dropping out of science. 
Mark Jessopp, Ruairi O’Reilly, 
Gordon Dalton University 
College Cork, Ireland. 
M.jessopp@uUcc.ie 


New electricity grids 
must develop in sync 


Future US and European 
electricity systems will depend 
on the harmonious evolution of 
low- and medium-voltage ‘smart’ 
grids with high-voltage ‘super’ 
grids (see M. Amin Nature 499, 
145-147; 2013). Without care, 
these two developments could 
undermine, rather than reinforce, 
one another. 

Both smart and super grids 
involve three layers (see go.nature. 
com/7flk7s): physical (power 
flows), cyber (information 
technology) and socio-economic 
(stakeholders). There are 
important issues in the physical 
layer, but the greatest tensions 
will emerge in the socio- 
economic layer as self-interested 
stakeholders advance local or 
continental interests. 

Smart grids relate to local 
consumers becoming electricity 
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producers themselves. This could 
diminish the need for high- 
voltage transmission grids as local 
power starts to serve nearby users. 

Super grids will help to 
integrate massive renewable 
sources and to ensure reliability. 
Their development will be led by 
transmission companies at a time 
when those businesses must adapt 
to emerging smart grids. 

Ifsmart and super approaches 
are not aligned, the result will be 
a frailer electricity system. It is 
not just technologies that need to 
match up: policy regulations and 
market structures must do so too. 

Smartness should not just be an 
issue for better local grids, but for 
the electricity system as a whole. 
Ettore Bompard, Marcelo 
Masera European Commission 
Joint Research Centre, Institute for 
Energy and Transport, Petten, the 
Netherlands. 
William J. Nuttall The Open 
University, Milton Keynes, UK. 
william.nuttall@open.ac.uk 


Curb clearance for 
oil-palm plantations 


We question whether using 
accurate genotyping to enhance 
palm-oil yields will greatly reduce 
the acreage devoted to palm-oil 
plantations (R. Singh et al. Nature 
http://doi.org/ndc; 2013). 
Global demand for palm 
oil by the food industry and 
for biofuel is rising, pushing 
expansion of plantations into 
forests and peatlands. As higher 
yields generate bigger profits, 
more companies will venture 
into palm-oil production and 
marginal lands may be developed. 
Moves to increase palm- 
oil yields should therefore be 
combined with strict government 
controls on the expansion 
of existing plantations into 
environmentally sensitive areas. 
Some countries have imposed 
restrictions on new plantation 
licences, but with limited success. 
In Indonesia, for example, the 
recent severe widespread smoke 
haze was mainly associated 
with peatland clearance for oil 


palms, and almost half of these 
fires occurred in areas with a 
moratorium for new licences (see 
go.nature.com/doiwwf). 

Finn Danielsen Nordic Agency 
for Development and Ecology, 
Copenhagen, Denmark. 
fd@nordeco.dk 

Faizal Parish Global Environment 
Centre, Selangor, Malaysia. 


Seize diplomats 
smuggling ivory 


One route for the smuggling 

of wildlife parts, such as rhino 
horn and elephant ivory, could 
be closed without the need for 
new international conventions or 
treaties (B. I. Kaaria and 

N. L. Muchiri Proc. 9th Int. Conf. 
Environ. Compliance Enforc. 
204-208; 2011). Rogue diplomats 
must be stopped from exploiting 
their consular immunity to flout 
customs regulations and support 
this illicit trade. 

The scale of the problem is 
unquantifiable, but anecdotal 
evidence indicates that it is 
widespread. Diplomatic bags 
containing horn were reported 
to be evading customs checks 
as long ago as the 1980s (E. B. 
Martin Pachyderm 21, 28-34; 
1996). In 2011, a South Korean 
diplomat was caught smuggling 
ivory (see go.nature.com/fuo43s). 

Countries could adopt 
the UK practice of giving full 
immunity only to diplomats who 
are accredited to that country; 
others who pass through British 
territory are subject to regular 
checks. Because there are few 
direct flights between African 
and Asian capitals, diplomats 
would run the risk of also being 
checked at European airports 
when changing flights. 

Apprehending diplomats 
carrying illegal materials would 
generate negative publicity, 
forcing countries to punish 
errant civil servants and reduce 
the problem. Immunity should 
not mean impunity. 

Kelvin S.-H. Peh University of 
Southampton, UK. 
kelvin.peh@gmail.com 
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Composite for smarter windows 


Glass has been prepared that selectively absorbs visible and near-infrared light when an electrochemical voltage is applied. 
This opens the way to ‘smart’ windows that block heat on demand, with or without optical transparency. SEE LETTER P.323 


BRIAN A. KORGEL 


esidential and commercial buildings 
Renn for about 40% of energy use 

and about 30% of energy-related 
carbon emissions in the United States’. To 
decrease this energy demand, materials are 
needed that help to regulate the heating and 
lighting requirements of buildings by respond- 
ing to environmental changes. In particular, 
electrochromic window materials, which 
change colour and/or transparency when 
subjected to an electric field, could signifi- 
cantly reduce energy consumption in build- 
ings’. On page 323 of this issue, Llordés et al.” 
report a great advance in the development of 
such materials. They have made a composite in 
which nanometre-scale crystals of indium tin 
oxide are embedded in a niobium oxide glass, 
with high control of nanocrystal loading and 
dispersion. The electrochromic performance 
of the composite is much better than expected 
from a simple sum of the optical absorption of 
its two separate components, because of both 
the nanostructure of the material and syner- 
gistic interactions that occur at the interface 
between the components. 

Inorganic nanocrystals are typically syn- 
thesized chemically with organic capping 
groups attached to aid the crystals’ dispers- 
ibility in solvents and to prevent aggregation 
or undesired particle growth. Unfortunately 
for many applications, the organic groups do 
not have useful electrical or optical properties. 
There has thus been much effort to replace 
the organic groups with inorganic groups that 
either add to the capabilities of the crystals or 
can be converted into an electrically or opti- 
cally active material. This approach has been 
used to make nanocrystal assemblies with 
greatly improved electrical properties’ and to 
convert nanocrystals capped with inorganic 
complexes into a more useful photovoltaic 
material’ (a material that converts light into 
electricity). 

Llordés et al. have used this strategy to create 
their nanoparticle-in-glass materials. The 
authors first stripped indium tin oxide (ITO) 
nanocrystals of their organic caps and replaced 
them with niobium-containing polyatomic 
ions known as polyoxometalate (POM) 
clusters. These clusters attach covalently to 


Figure 1 | Electrochromic window design. Llordés et al.’ propose that their nanoparticle-in-glass 
composite material could be used to make windows that controllably and selectively absorb visible 
light and near-infrared light (heat). a, In the design, the window is an electrochemical cell in which two 
conducting glass panes are separated by a solid electrolyte material. The authors’ material is deposited 
on one pane, forming an electrode; a counter electrode is deposited on the other pane. In the absence of 
an electrical load, the window is transparent to visible and near-infrared light. b, When an intermediate 
voltage is applied, charge carriers (lithium ions, Li’, and electrons, e-) move through the circuit. The 
nanoparticles in the composite become chemically reduced, whereupon they block most incoming 
near-infrared light. c, At lower voltages, the glass matrix of the composite also becomes reduced and 


blocks most incoming visible light. 


the ITO surface to create a shell around the 
nanocrystal. The researchers then condensed 
the modified nanocrystals into a film, simply 
by evaporating the solvent from a dispersion of 
the crystals. Finally, they converted the POM 
between the densely packed ITO nanocrystals 
into a niobium oxide (NbO,) glass matrix by 
heating the film to 400°C. Compared with 
previously reported synthetic routes for mak- 
ing nanoparticle-in-glass materials, in which 
inorganic crystals are grown within a glass®, 
Llordés and co-workers’ method provides 
rigorous control over the nanocrystals’ size 
distribution and volume fraction. And, by 
adding more POM to the dispersion of POM- 
stabilized ITO nanocrystals, the authors could 
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increase the volume fraction of the NbO, 
glass matrix. 

One of the key features of the ITO nano- 
crystal-NbO, glass material is that the glass 
is covalently bonded to the nanocrystals. This 
restricts the molecular orientations available to 
the octahedral NbO, units found in the glass, 
and leads to remarkable structural ordering 
that differs from that of pure NbO,,. It turns 
out that this ordering improves the electro- 
chromic properties of the glass matrix: NbO, 
in the composite is five times darker than the 
bulk material when a similar voltage is applied. 

ITO nanocrystals are also electrochromic, 
but in a different wavelength region from 
NbO, : they undergo reversible electrochemical 


redox reactions and absorb near-infrared 
light in the reduced state, but are transparent 
to this part of the spectrum when oxidized’. 
The combination of ITO nanocrystals with a 
NbO, glass matrix therefore yields a material 
in which both visible and near-infrared light 
absorption can be electrochemically modu- 
lated. This material could thus be used in 
smart windows, to control the amount of both 
heat (near-infrared) and light passing through 
them (Fig. 1). What’s more, the optical trans- 
parency can be tuned independently of the 
near-infrared transparency. 

Llordés and colleagues’ approach for making 
composite materials of inorganic nanocrystals 
in glass opens the way to a range of new 
material properties and applications, not just 
in electrochromics. The challenge for each 
application is to identify the best combinations 
of nanocrystal composition and modifiable 
inorganic capping groups. 

More specifically, several issues must still 
be addressed before the material can be used 
in windows. The authors used lithium metal 
as a counter electrode to test the performance 
of their material, but this is not acceptable for 
commercial applications because of safety 
concerns. A suitable counter electrode must 
be identified. Additionally, the researchers 
performed their photoelectrochemical tests 
using a liquid electrolyte as a charge-carrying 
material, whereas a solid electrolyte is probably 
more appropriate for buildings applications. 
The materials needed to build an electro- 
chromic window will be more expensive 
than conventional window materials, so the 
extra expense will need to be balanced by the 
energy and cost savings that can be achieved 
through their use. Ideally, no power input will 
be needed to maintain transparency or opacity, 
but this ability remains to be explored. 

Nevertheless, Llordés and co-workers’ 
results are promising. With appropriate coun- 
ter electrodes and a solid-state electrolyte, and 
iflong-term stability of the composite can be 
demonstrated, windows that have multispectral 
band transparency may be just around the cor- 
ner, potentially enabling buildings that offer 
unprecedented energy efficiency and comfort. m 
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RNA exerts self-control 


A crystal structure of two bound RNA molecules not only provides insight into 
how regulatory riboswitch sequences affect messenger RNA expression, but also 
expands our understanding of RNA structure and architecture. SEE LETTER P.363 


BHASKAR CHETNANI & 
ALFONSO MONDRAGON 


ver the past three decades, our knowl- 

edge of the role of RNA in cellular 

processes has expanded enormously’. 
For example, the structures of various regula- 
tory RNA sequences called riboswitches have 
revealed how they affect the transcription or 
translation of their downstream messenger 
RNA sequences through the recognition and 
binding of specific ligands*. On page 363 of 
this issue, Zhang and Ferré-D’Amaré* provide 
another structural insight — this time, into a 
bacterial riboswitch, called a T-box, in com- 
plex with a transfer RNA molecule. Their data 
elucidate how one RNA molecule recognizes 
another RNA of similar size and regulates its 
own transcription through a simple switching 
mechanism”. 

To support the process of protein syn- 
thesis, cells must regulate the pool of tRNAs 
that become charged with (covalently bound 
to) specific amino-acid residues and deliver 
them to the growing protein chain. Enzymes 
known as aminoacyl-tRNA synthetases carry 
out tRNA charging. In Gram-positive bac- 
teria, a T-box riboswitch located upstream 
of the coding region of the mRNA of each 


*This article and the paper under discussion? were 
published online on 28 July 2013. 
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aminoacyl-tRNA synthetase negatively 
regulates synthesis of the mRNA“. 

The T-box RNA consists of at least two inde- 
pendently folded domains: a sensory ‘aptamer 
domain that forms a long stem called stem I 
and that binds to specific tRNAs; and a second 
domain, which can switch between two alter- 
native conformations depending on whether 
the bound tRNA is charged or uncharged’. 
Whereas binding of a charged tRNA leads to 
termination of transcription of the aminoacyl- 
tRNA coding sequence (Fig. 1a), an uncharged 
tRNA stabilizes an antiterminator conforma- 
tion of the T-box, leading to transcription of 
the mRNA and subsequent protein synthesis 
(Fig. 1b). 

Stem I is a mostly double-helical RNA 
domain roughly 100 nucleotides long and is 
studded with several phylogenetically con- 
served structural motifs along its length*. 
Previous work has implicated most of these 
motifs in tRNA recognition and binding’, 
yet atomic-level details of this process have 
remained largely unknown. Zhang and Ferré- 
D’Amaré provide the first glimpse into this 
mechanism by describing the crystal structure 
of a complex between stem I of the T-box of 
tRNA synthetase for the amino acid glycine 
and an uncharged glycyl-tRNA. The authors’ 
co-crystal structure also explains the precise 
role of the stem I motifs, delivers intriguing 


b Transcription 


Figure 1 | RNA meets RNA. Zhang and Ferré-D’Amaré’ present the structure of the stem I domain 

of a T-box riboswitch in complex with an uncharged tRNA. Stem I bends at the dinucleotide bulge 

and the K-turn to recognize tRNA by binding to its anticodon loop and the T- and D-loops. a, The 
amino acid present in a charged tRNA is thought to prevent interaction between the T-box sequence 
motif downstream of stem I and the acceptor end of the tRNA, favouring formation of a terminator 
loop that stops transcription of the mRNA downstream of the T-box. b, By contrast, the free acceptor 
end ofan uncharged tRNA interacts directly with the T-box sequence and leads to the formation of an 
antitermination loop, allowing transcription of the mRNA and its subsequent translation into a protein. 
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information about their synergy and offers 
a structural rationale for their sequence 
conservation. 

Specific recognition of a tRNA by stem I 
seems to involve two main tRNA regions: 
the anticodon loop and the T- and D-loops. 
Specifically, stem I bends to follow closely the 
shape of the tRNA and contacts the anticodon 
loop and the T- and D-loops with its proximal 
and distal ends, respectively. 

The interaction between the tRNA T- and 
D-loops and the T-box stem I was anticipated, 
on the basis of recent bioinformatics, bio- 
chemical and structural data®’. For example, a 
previous crystal structure of a distal segment 
of the same stem I showed’ two interleaved 
loops forming a similar arrangement to the 
one observed in two other large and unrelated 
RNAs that also recognize tRNAs — the 
ribozyme RNase P and an RNA component of 
the ribosomes*”. In those large RNAs, the inter- 
leaved loops participate in tRNA recognition 
and binding by stacking their planar bases over 
evolutionarily conserved unstacked bases in 
the tRNA. 

Zhang and Ferré-D’Amaré’s structure of 
stem I and tRNA corroborates these predic- 
tions and demonstrates conclusively that the 
interleaved loops recognize the conserved 
tRNA ‘elbow in a similar fashion to that in the 
previously observed cases*’. The structure also 
reveals a surprise — the interactions between 
the three-nucleotide anticodon sequence of 
glycyl-tRNA and stem I are almost identical 
to those made between a tRNA anticodon and 
mRNAs on the ribosome”"”. 

The present paper further shows an elegant 
‘mutually induced fit’ mechanism for T-box 
riboswitches, by which both binding partners 
change conformation to attain shape com- 
plementarity. Finally, whereas the inherent 
flexibility of a tRNA is well established"!, the 
co-crystal structure demonstrates how stem I 
bends sharply around two hinge regions to 
embrace the tRNA. The hinge regions com- 
prise a conserved dinucleotide bulge and 
K-turn motif, respectively (Fig. 1), and the 
structure explains why mutations in these 
hinges lead to impaired tRNA binding and 
regulation*”. 

Zhang and Ferré-D’Amaré’s work is thus a 
big step towards understanding the mecha- 
nism of action of T-box riboswitches at the 
atomic level. It explains how a relatively small 
RNA molecule can recognize a specific tRNA 
by maximizing interactions. Details of the 
other important parts of the switching mech- 
anism — namely, recognition of the charged 
state of the tRNA and the conformational 
changes that lead to regulation — remain 
unknown, but this study brings us closer to 
understanding this fascinating riboswitch in 
even greater detail. 

An added bonus is that the authors’ struc- 
ture provides truly interesting information on 
RNA-RNA recognition and on RNA structure 


and architecture. Despite the paucity of struc- 
tures of large RNA molecules, some common- 
alities are starting to emerge. In all known 
cases, recognition of tRNA involves at least 
two distinct and distant areas. RNA flexibility 
plays a notable part in recognition and serves 
to maximize interactions and achieve shape 
complementarity. In addition, the recurrent 
use of a few structural motifs seems to have 
a large role in RNA architecture, as illustrated 
by the presence of several previously known 
structural motifs along stem I. 

The structure of the T-box stem I-tRNA 
complex is one more example of the con- 
formational versatility of RNA that allows 
it to perform various functions. As we learn 
more about RNA structure and function, it 
frequently emerges that RNA molecules can 
execute many of the functions that are nor- 
mally ascribed to proteins. One can, therefore, 
imagine that there was a time when many of 
the functions that are now performed by pro- 
teins or by proteins in complex with nucleic 
acids were carried out solely by versatile RNA 
molecules. We look forward to more examples 


GENE THERAPY 


of RNA molecules having unexpected roles in 
biology. = 
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Primed for take-off 


Gene therapy is finally getting a bumper crop of data that show clinical efficacy 
after fine-tuning of key parameters that control safety and potency. Supporting 
evidence comes from treatment of two life-threatening human diseases. 


PHILIPPE LEBOULCH 


he development of the field of gene 

therapy shares many similarities with 

the history of aviation. Each is based on 
deceptively simple principles: the introduction 
of a therapeutic gene into cells and the flow of 
air over an aircraft's wing. However, the devil 
is in the details, and the details are in the num- 
bers. Each field was marred by shortcomings 
and adverse events early on. But in spite of 
naysayers lacking vision, both fields continued 
their quest, and there is now firm hope that 
gene therapy will soon do for medicine what 
aeroplanes did for transportation. Good rea- 
son for this optimism comes from the results of 
two studies published in Science by Biffi et al.’ 
and Aiuti et al.’. 

Both studies make use of lentiviral vector 
transduction, which continues to pave the way 
for gene therapy’. The technique involves col- 
lecting a type of stem cell from the body, adding 
therapeutic genes using a delivery system (or 
vector) that has been safely engineered from 
a few components of HIV, and putting the 
modified cells back into the patient. The first 
international trials using lentiviral vectors for 
the gene therapy of inherited disorders were 
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conducted in France for the diseases adreno- 
leukodystrophy* and B-thalassaemia’, and 
provided evidence of clinical efficacy in several 
patients for at least six years. The Science papers 
report the results of two more trials — one for 
metachromatic leukodystrophy (MLD)! and 
another for Wiskott-Aldrich syndrome* — 
with further quantitative improvements in 
efficacy. 

MLD is an inherited disorder caused by 
deficiency of the enzyme arylsulphatase A 
(ARSA). Patients with MLD accumulate 
a toxic metabolite in the nervous system, 
which results in massive cognitive and motor 
impairments. Children with the severe form, 
called MLD-LI, die within a few years of 
symptom onset, and current treatments are 
ineffective. 

Biffi et al. collected blood-making cells 
called haematopoietic stem cells (HSCs) 
from the bone marrow of three children with 
MLD-LI who were not yet showing symptoms, 
and exposed the cells to an ARSA-expressing 
lentiviral vector. These transduced cells were 
then re-injected into the children, who mean- 
while had received a brief course of chemo- 
therapy to eliminate uncorrected natural 
HSCs. This process is known as autologous 


Highly purified, 3% 


HSC high-titre LV HSCs with 


corrected gene 


Cell release 
without cryo- 
preservation 


Patient with 
MLD or WAS 


Myeloablation 
(£ immunosuppression 


adapted to specific disease) (high dose of cells) 


Figure 1 | Life-saving vectors. Researchers’” harvested haematopoietic stem cells (HSCs) from the 


bone marrow of children with mutations in either the gene that causes metachromatic leukodystrophy 
(MLD) or the gene that results in Wiskott-Aldrich syndrome (WAS). In the lab, they then exposed the 


derived HSCs to highly purified and concentrated lentiviral vectors (LV) carrying the normal genes 
in the presence of cytokine molecules to stimulate vector integration into HSC genomic DNA. Before 
re-injecting the vector-carrying cells into the patients’ blood, the authors subjected the children to 


Intravenous infusion of 
fresh, transduced cells 


myeloablative chemotherapy (and immunosuppression in the case of WAS) to destroy their remaining 
defective HSCs and allow successful incorporation of the genetically corrected HSCs. Indicated are some 
of the steps that resulted in improvement in gene-transfer efficiency and clinical benefit over previous 


gene-therapy trials. 


transplantation with myeloablative condi- 
tioning (Fig. 1). The authors hypothesized 
that blood cells now expressing the missing 
enzyme at above-physiological levels (spe- 
cific enhancer and promoter sequences were 
used to drive efficient expression of ARSA in 
the lentiviral vector) would bring enough of 
the missing enzyme to the nervous system, 
directly or indirectly, to have therapeutic 
benefit. 

The results were beyond expectations. 
Almost two years after the therapy, more than 
60% of blood cells contained the vector, with 
several vector copies per cell, resulting in 
ARSA levels ten times greater than in healthy 
individuals. These above-physiological levels 
are desirable for optimal potency and are con- 
sidered safe. The authors also detected ARSA 
activity in the cerebrospinal fluid, and there 
was no clinical progression of the disease in 
any of the three patients 7-21 months after the 
predicted age of symptom onset. 

Wiskott-Aldrich syndrome is also an inher- 
ited disorder, in which a gene defect causes 
immunodeficiency. Focusing on three chil- 
dren with the disorder, Aiuti and colleagues 
used a similar approach to that of Biffi et al., 
but chose a different conditioning regimen. 
Because corrected HSCs from patients with 
this syndrome are known to havea substantial 
selective advantage in vivo, the authors used 


less myeloablation but more immunosuppres- 
sion to stimulate engraftment of the genetically 
modified immunocompetent cells. 

Aiuti et al. also observed high levels 
(25-50%) of blood cells expressing the lenti- 
viral vector in all three patients. Although 
quantitative interpretation of this result is 
difficult because of the type of conditioning 
regimen used and the existence of substan- 
tial in vivo selection, this clinical response 
is impressive and better than that achieved 
in previous gene-therapy trials for Wiskott- 
Aldrich syndrome’. 

What made these striking improvements 
possible? First, lentiviral vectors continue to 
rule unsurpassed because of their many useful 
properties. These include effective transduc- 
tion of quiescent HSCs; compatibility with 
therapeutic gene constructs of large size and 
complexity; effective concentration and purifi- 
cation of vector particles to increase viral titres 
and potency; and amenability to cryopreserva- 
tion’ (Fig. 1). But as the authors suggest’”, the 
answer may just be a numbers game. 

Vector copy numbers in patients’ blood 
cells after engraftment in the MLD trial 
were around five times higher than those 
observed in trials for adrenoleukodystro- 
phy’ and 8-thalassaemia® (around 0.2 copies 
of the vector after 6 years; my unpublished 
data). Moreover, the latest studies may have 
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benefited from a higher degree of vector purity 
following its high-titre production. This 
would have minimized contamination from 
defective interfering vector particles. Another 
potentially important modification was the 
immediate reinfusion of non-cryopreserved 
cells, thus avoiding compromising HSCs 
before their engraftment. The very young age 
of the patients (7-16 months for MLD com- 
pared with 18 years for 8-thalassaemia*) and 
the large dose of the infused cells may also have 
contributed positively. 

What about safety? The main concern with 
HSC gene therapy is the potential activation of 
cancer genes following random integration of 
vectors in the host genome, because efficient 
tethering of lentiviral vectors to genomic areas 
of lesser cancer risk is not currently feasible in 
these cells. Biffiand colleagues and Aiuti et al. 
take comfort from the fact that their vectors 
showed a heterogeneous pattern of integra- 
tion into chromosomal sites, as observed after 
pan-genomic monitoring of blood cells. This 
is a presumptive indicator that no particular 
genomic areas carrying the vector may result 
in pre-malignant outgrowth of an HSC or 
another blood-cell clone. 

Yet the meaning of such analyses remains 
unclear’ and can either be falsely reassuring 
or alarming. For instance, the partially domi- 
nant cell clone bearing an integration site 
within the HMGA2 gene in a B-thalassaemia 
patient raised anxiety among observers about 
the possibility of cancer development’. But six 
years after gene therapy, this clone is no longer 
dominant, although vector copy numbers in 
the patient’s cells and expression of the cor- 
rected gene product, B-globin, remain suffi- 
ciently high for the patient to be completely 
independent of blood transfusions (my unpub- 
lished data). It is intriguing that HMGA2 has 
now been identified as a vector integration site 
in various studies, including the present trial 
for Wiskott-Aldrich syndrome’. 

The time is ripe for pharmaceutical compa- 
nies to grab the baton and bring gene thera- 
pies to patients in need. The naysayers now 
whisper that gene therapy is too expensive to 
treat patients, no matter where they are. But 
compared with the enormous lifelong cost of 
enzyme replacement or even taking small- 
molecule drugs, the thoughtful pricing of 
many gene-therapy approaches that require 
a one-off treatment may meet the respective 
requirements of patients, the pharmaceutical 
industry and institutional funders, even in 
developing countries. m 
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Reliable teleportation 


Two complementary experiments have demonstrated deterministic quantum 
teleportation of quantum bits. The results could find applications in quantum 
communications and computing. SEE LETTERS P.315 & p.319 


TIMOTHY C. RALPH 


he ability to teleport quantum states 

from one place to another by means 

of classical communication systems is 

both counter-intuitive and potentially useful. 
However, experimental demonstrations of 
such quantum teleportation have tended to 
be hit-and-miss, with few successful attempts. 
Two studies in this issue now report determin- 
istic quantum teleportation: one involves the 
teleportation of the quantum state of a single 
photon, by Takeda et al.' (page 315); the other 
is teleportation of the quantum state of a super- 
conducting circuit, by Steffen et al.” (page 319). 
Quantum teleportation’ relies on the abil- 
ity to produce and measure entangled states. 
Entanglement refers to strong correlations 
between distinct quantum systems that defy 
classical explanation. Generically, teleporta- 
tion protocols proceed by three steps (Fig. 1): 
a pair of quantum systems in an entangled state 
is produced and distributed, one to a sender 
(Alice) and the other to a receiver (Bob); Alice 
makes a joint measurement of her member of 


Single-photon 


the entangled pair and the unknown state she 
wishes to teleport, and sends the measurement 
result to Bob; Bob uses the measurement result 
he receives from Alice to manipulate his quan- 
tum system in a predetermined way. After this 
manipulation, Bobs quantum system ends up 
being in the unknown state, that teleported 
from Alice to Bob, with the only direct com- 
munication being a classical message — Alice's 
measurement result. 

Most previous quantum teleportation 
experiments have been non-deterministic 
because some steps of the protocol, in par- 
ticular producing the initial entanglement 
and carrying out the joint measurement, have 
been difficult to perform efficiently. For exam- 
ple, in single-photon quantum optics, which 
most earlier teleportation studies have used, 
the following restrictions apply: first, entan- 
gled pairs are produced by a spontaneous pro- 
cess that typically succeeds less than once in 
100 attempts; second, with linear optics, the 
required joint measurement can be performed 
with a probability of success of only 50%. In 
their studies, Takeda et al. and Steffen et al. 
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Figure 1 | A quantum teleportation protocol. A qubit, here either a single photon’ in a superposition of 
two pulses or a superconducting circuit’, is prepared in an unknown state. A pair of quantum systems in 
an entangled state has previously been prepared and distributed between a sender (Alice) and a receiver 
(Bob). Alice makes a joint measurement between the unknown state, which she wishes to teleport, and 
her member of the entangled pair. The measurement result is sent to Bob via a classical channel. He uses it 
to manipulate his member of the entangled pair, in the process recreating the unknown state at his station. 
The quantum link that allows the entangled pair to be distributed to Alice and Bob is optical in the case of 
the photon and uses microwave photons in the superconducting case. 
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take very different approaches to overcoming 
this problem of limited efficiency. 

Takeda and colleagues’ experiment tele- 
ports a single-photon quantum bit (qubit) 
with much higher efficiency than all earlier 
optical studies. A qubit is a quantum system 
that has two distinct outcomes — in this case, 
a single photon in a superposition of two dif- 
ferent arrival times at a detector. The trick the 
authors use is to perform a more general form 
of teleportation that works for any optical-field 
state, rather than only for single-photon qubit 
states. This sounds, and is, hard to achieve, but 
it turns out to make a big difference because 
deterministic sources of optical-field entangle- 
ment and efficient joint field measurements 
are both available. This ‘continuous-variable’ 
teleportation protocol’ has previously been 
demonstrated for multi-photon states™®, but 
this is the first time that a single-photon qubit 
has been teleported by this method. The result 
is the deterministic teleportation of a single- 
photon qubit with a quality that exceeds the 
limits set by any classical protocol — that is, 
one that does not use entanglement but in 
which Alice instead tries to directly measure 
(necessarily imperfectly) the state of the qubit 
and sends this information to Bob. 

Steffen and colleagues tackled the efficiency 
problem in a different way. They implemented 
teleportation of a solid-state qubit — a super- 
conducting circuit. The qubit circuit is on the 
100-micrometre scale and is held at tempera- 
tures of around 20 millikelvins. Qubits are 
formed from small currents running in the 
circuit’, and at these sizes and temperatures, 
the currents behave quantum mechanically. 
As in Takeda and colleagues’ study, photons 
are still used to move information around, but 
these are now at microwave (rather than opti- 
cal) frequencies®. Alice and Bob are separated 
by about 5 millimetres. Strong interactions 
between the superconducting qubits and the 
microwave photons allow entanglement to be 
produced deterministically between the super- 
conducting qubits held by Alice and Bob. Such 
interactions can also be used to make a joint 
measurement deterministically between Alice's 
qubit and the unknown qubit, as required by 
the teleportation protocol. Again, the result 
is deterministic teleportation with a quality 
that exceeds the classical limit. Deterministic 
teleportation has been achieved previously in 
ion-trap experiments, but in that situation the 
distances between the qubits were 1,000 times 
smaller®””. 

Both experiments are ‘hero’ experiments, 
in the sense that they only just exceed classi- 
cal limits when they are running in the fully 
deterministic mode. In Takeda and colleagues’ 
study, this is mainly due to the limited strength 
of the field entanglement, whereas for Steffen 
and co-workers’ study the main problem is 
imperfect differentiation of the outcomes of 
the joint-measurement results. However, both 
groups are also able to run their experiments in 


anon-deterministic mode. In Takeda and col- 
leagues’ experiment, the photon qubit can be 
successfully teleported about 40% of the time 
(compared with a value of much less than 1% 
for all previous optical studies), and repro- 
duced at Bob with about 88% quality (similar 
to the best achieved in previous experiments). 
Steffen et al. can arrange for their qubit to be 
teleported 25% of the time with a quality of 
about 82%. For both teams, the limitations in 
their experiments are clearly understood and 
shown not to be fundamental impediments to 
future improvements. 

The advances made in these experiments 
should flow on to allow improved quantum- 
information protocols. Takeda and co-workers’ 
optical ‘flying qubits’ have potential applica- 
tions in quantum communications. But there is 
a caveat. Improving quantum communications 
using teleportation requires the purification of 
entangled states sent through a noisy channel. 
Teleportation could therefore be used to trans- 
fer quantum states between distant locations 
with better quality than sending them directly. 
Purification can be achieved by distillation 
techniques. However, distillation techniques 
for field entanglement are not as advanced as 
those for qubit entanglement. Nevertheless, 
promising advances in distilling field entan- 
glement have been made”. By contrast, the 
solid-state ‘standing qubits’ in Steffen and 
colleagues’ experiment are more likely to find 
applications in quantum computing”. Notably, 
they demonstrate the increasing sophistication 
and quality of the manipulations possible with 
superconducting qubits coupled to microwave 
transmission lines, and raise the profile of such 
qubits as potential building blocks for large- 
scale quantum computation. 

More progress is needed before determinis- 
tic quantum teleportation under practical con- 
ditions, and with quality approaching 100%, 
becomes a reality. But these experiments rep- 
resent significant steps along that path. = 
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Metabolite turns 
master regulator 


The phenomenon of catabolite repression enables microorganisms to use their 
favourite carbon source first. New work reveals «- ketoacids as key effectors of 
this process, with their levels regulating gene expression. SEE ARTICLE P.301 


JOSHUA D. RABINOWITZ & 
THOMAS J. SILHAVY 


utrients in the environment are a 

primary determinant of microbial 

physiology. When preferred nutrients 
are abundant, microbes grow fast. When they 
are scarce, growth slows down. This change 
in growth rate is accompanied by a change 
in cellular composition, with fast-growing 
cells being loaded with ribosomes (which are 
needed for rapid protein production), and 
slower-growing cells containing more meta- 
bolic enzymes for nutrient assimilation (catab- 
olism)’. In this issue, You et al. (page 301)° 
identify a striking linear relationship between 
the total protein composition (the proteome) 
of a cell and its growth rate, which extends 
beyond ribosomes to metabolic enzymes”. 
They further demonstrate how such a relation- 
ship can arise, in part, from a new regulatory 
connection, in which a particular class of car- 
bon catabolite called a-ketoacids, which form 


*This article and the paper under discussion® were 
published online on 7 August 2013. 


Alternative 
carbon sources 


Glucose = Carbon ===» Carbon skeleton 
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a-ketoacids 


the carbon skeletons of amino acids, serves as 
a master transcriptional regulator by inhibiting 
the production of cyclic AMP — the primary 
inducer of carbon-catabolic genes. 

Perhaps the most intensively studied exam- 
ple of gene regulation involves the enzymes of 
the bacterium Escherichia coli that mediate lac- 
tose catabolism. These enzymes are expressed 
only when lactose is present and glucose (the 
preferred carbon source) is not*. Escherichia 
coli detects the presence of lactose through 
binding of this sugar to, and inactivation of, the 
lac repressor protein’, and it senses the absence 
of glucose from elevated levels of cAMP*, which 
binds to and activates the transcription factor 
Crp (refs 7, 8). Identification of this classic reg- 
ulatory loop involved seminal contributions 
from three Nobel laureates — Francois Jacob 
and Jacques Monod, who won the 1965 physi- 
ology prize for their pioneering studies of gene 
regulation” , and Earl Sutherland, winner of the 
1971 prize for discoveries related to cAMP, who 
later identified cAMP in E. coli and showed that 
its levels rapidly fall in response to glucose’®. 

What controls the activity of adenylate 


Nitrogen 


ae Amino acids 


Proteins 


»> Energy 


Figure 1 | Regulation of cAMP levels by carbon and nitrogen availability. The main biosynthetic task 
of the bacterium Escherichia coli is protein production. This requires energy generation (grey arrow), 
carbon-skeleton synthesis, nitrogen incorporation to make amino acids, and protein synthesis. The 
anabolic fluxes (red, green and blue arrows) increase linearly with growth rate, as does the required 
anabolic proteome fraction except for processes that are directly slowed by nutrient limitation (such 

as amino-acid synthesis during nitrogen limitation). Carbon catabolism provides energy and building 
blocks for anabolism, and cAMP allocates the proper fraction of the proteome to carbon-catabolic 
enzymes. You et al.’ find that this is achieved by a new regulatory loop, wherein a-ketoacids inhibit 
cAMP production: when favoured carbon sources such as glucose are present or nitrogen is limiting, 
carbon influx exceeds anabolic capability and a-ketoacid accumulation inhibits cAMP. Conversely, when 
favoured carbon sources are depleted, a-ketoacid levels fall, and cAMP increases to stimulate production 
of the required carbon-catabolic machinery (orange arrows). 


15 AUGUST 2013 | VOL 500 | NATURE | 283 


© 2013 Macmillan Publishers Limited. All rights reserved 


| RESEARCH | NEWS & VIEWS 


cyclase (the enzyme that makes cAMP) and 
so cAMP levels? The phosphorylated form 
of the enzyme EIIA“ can activate adenylate 
cyclase. EIIA“ is a component of the bac- 
terial phosphotransferase system, which takes 
the phosphate group from phosphoenolpyru- 
vate — the last intermediate in the biochemical 
process of glycolysis — and passes it through 
a series of enzymes, eventually leading to 
glucose import, phosphorylation and metabo- 
lism. When glucose is absent, the phosphorelay 
activity of EI[A“ ceases, and the phosphoryl- 
ated enzyme induces cAMP production’. 

Although elegant, this is not the full story. 
Carbon sources that are not imported by way 
of the phosphotransferase system also tend to 
counteract cAMP production. In 1961, Boris 
Magasanik hypothesized that the general abil- 
ity of carbon sources to repress the expression 
of catabolic enzymes reflects the fact that all 
carbon sources converge to produce a key sig- 
nalling metabolite — a process he termed cat- 
abolite repression”. The relevant catabolites, 
however, were never identified. 

You et al. now show that the key catabo- 
lites are a-ketoacids, which inhibit adenylate 
cyclase independently of the phosphotrans- 
ferase system. Although several a-ketoacids 
can inhibit adenylate cyclase, a-ketoglutarate 
is the most abundant" and therefore likely to 
be physiologically dominant. As both an inter- 
mediate of the energy-producing tricarboxylic 
acid cycle and the carbonaceous substrate for 
nitrogen assimilation, a-ketoglutarate reflects 
the balance of carbon to nitrogen in available 
nutrients. Inhibition of adenylate cyclase by 
a-ketoglutarate explains the long-standing 
observations” “* that low nitrogen availability 
blocks expression of what we now know are 
cAMP-induced genes. Fascinatingly, the role 
of a-ketoglutarate as both a metabolite and a 
master regulator is evolutionarily conserved; 
in humans it serves as a cofactor to enzymes 
that covalently modify transcription factors, 
histone proteins and DNA. 

How and why does this regulatory con- 
nection lead to linear relationships between 
gene expression and cellular growth rate in 
microbes? When carbon limitation slows cel- 
lular growth, there is an increasing need for 
carbon-catabolic enzymes and a decreasing 
need for anabolic enzymes, which promote 
biosynthesis and use carbon as a building 
block (Fig. 1). Quantitatively, in the absence 
of futile cycling (in which two metabolic 
pathways operating in opposite directions 
cancel out each other’s effects, wastefully 
using up energy), the cell’s anabolic metabo- 
lism is directly proportional to growth rate. 
So, as growth slows, the required amount of 
anabolic enzymes, assuming their activities 
are constant, decreases linearly. Accordingly, 
the resulting ‘empty space’ in the proteome — 
which will be filled with enzymes required 
for coping with the carbon limitation, such as 
cAMP-regulated gene products — increases 


linearly with decreasing cellular growth. 

When growth slows because of nitrogen 
limitation, there is less need for carbon- 
catabolic enzymes and more demand for those 
involved in nitrogen assimilation. This time, 
the decrease in the requirement for carbon- 
catabolic flux is linear with decreasing growth 
rate. Thus, the situation is flipped, but the 
optimal responses still remain linear. 

Inhibition of cAMP production by 
a-ketoglutarate naturally produces the 
desired responses. Whenever carbon-catabolic 
machinery is in excess relative to anabolic 
machinery, a-ketoglutarate accumulates, 
cAMP levels fall and carbon-catabolic 
enzymes are repressed. Conversely, when ana- 
bolic machinery is in excess, a-ketoglutarate 
is depleted, cAMP levels rise and carbon- 
catabolic enzymes increase. The steady-state 
concentration of cAMP is therefore the factor 
that ensures that the proper amount of the pro- 
teome is devoted to carbon-catabolic enzymes. 
Consequently, the physiological function of 
cAMP signalling goes beyond simply ena- 
bling hierarchical utilization of carbon sources, 
which is of unclear significance for fitness and 
can also be achieved through other mecha- 
nisms (including inducer exclusion’’). Instead, 
cAMP controls the fraction of the proteome 
devoted to carbon catabolism. 

Ina broader historical context, during the 
half-century between the coining of the term 
catabolite repression and the present work, 
physiology has largely taken a back seat to 
molecular genetics and, more recently, to 
genomics. Therefore, You and colleagues’ use 
of quantitative physiology to elucidate the 
molecular mechanism of catabolite repression 
is particularly noteworthy. One hopes that it is 


MICROBIOLOGY 


the beginning of a rebalancing, in which physi- 
ology-driven systems biology emerges as a full 
equal to research driven by molecular biology. 
As the present paper shows, the strength of the 
physiology-driven approach is not in finding 
the full scope of molecular events occurring in 
biological systems, but in identifying the most 
functionally important ones. m 
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A weapon for 
bacterial warfare 


The finding that bacteria use a sharp spike to deliver toxins into competing 
microorganisms, and that this mechanism co-evolved with a bacteriophage 
structure, presents a new vision of bacterial secretion systems. SEE LETTER P.350 


ALAIN FILLOUX 


acteria have evolved molecular machines 
B to secrete toxins and proteins into their 

environment or into target cells. The 
distinctive features of these machines led 
microbiologists to classify them by type; the 
type VI secretion system’, for example, has 
the unusual function of injecting toxins into 
bacterial prey, causing death’. This system 
also shows unprecedented similarity to the 
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puncturing devices and injection machines 
of bacteriophages, viruses that infect bac- 
teria’. Recognition of this remarkable like- 
ness has led to notable discoveries, and that 
reported by Shneider et al.* on page 350 of 
this issue might be viewed as the cherry on 
top*. The researchers have solved the struc- 
ture of proteins belonging to the PAAR family 
and show how these proteins function in the 


*This article and the paper under discussion’ were 
published online on 7 August 2013. 
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Figure 1 | Structures of the T4 bacteriophage and the bacterial type VI secretion system. Shneider 

et al." have established that small proteins belonging to the PAAR family sit on top of surprisingly 

similar protein structures that are found in the tail of T4 bacteriophage viruses (a) and the type IV 
secretion system (T6SS) of some bacteria (b). Both complexes contain a tube of hexameric protein rings 
(comprising gp19 in the bacteriophage and Hcp in T6SS) and a ‘spike, made up of the VgrG protein in 
the T6SS and the gp27,-gp5, protein complex in the T4 bacteriophage. Both tubes are surrounded by a 
sheath structure that contracts to push the spike through bacterial cell membranes; bacteriophages use 
this system to inject DNA into a cell they are infecting, whereas bacteria use the T6SS to inject proteins or 
toxins into other cells. Shneider and colleagues suggest that PAAR proteins associated with the T6SS act 
as adaptors that transport these proteins or toxins. The hinge (or a flexible domain) possibly connects the 


toxin to the PAAR. 


type VI secretion system to deliver proteins to 
target cells — a breakthrough for our under- 
standing of the assembly and evolution of 
this machine. 

To follow this story, we need to appreci- 
ate how studies of bacteriophages have aided 
investigations of the action of the type VI 
secretion system (T6SS). Early database 
searches for genes with sequence similarity to 
those encoding T6SS proteins failed to pro- 
vide leads as to the function of these genes, 
but insight came from the three-dimensional 
structure of two T6SS proteins, Hcp and VgrG. 
Hcp was shown to form hexameric rings that 
pile on top of each other to make a tube’, and 
docking experiments’ showed structural simi- 
larities between Hcp and gp19, the protein that 
forms a tube in the tail of the T4 bacteriophage 
(Fig. 1). Furthermore, the structure of VgrG*” 
bears a striking resemblance to the gp27,-gp5, 
protein complex in this bacteriophage’. This 
complex forms a needle-like structure ending 
in a rigid B-stranded helix; the needle func- 
tions as a puncturing device that sits on top 
of the gp19 tube. A contractile sheath struc- 
ture pushes the tube and puncturing device 
through the envelope of bacterial cells, inject- 
ing the bacteriophage DNA into these target 
cells®. Electron microscopy studies’* have 
revealed that two proteins of T6SS form a 
similar sheath-like structure. 

In possessing a sheath, a tube and a 
puncturing device, there is no doubt that the 
T6SS resembles the tail of a bacteriophage, 
although its directionality is inverted — it 


pushes from inside the cell. Despite the fact 
that the systems clearly have different roles, 
with T6SS injecting proteins or toxins” and 
the bacteriophage tail injecting DNA, Shnei- 
der et al. sought to investigate just how far the 
structural similarities between the two go. The 
authors noted, from a reconstruction’ of the 
bacteriophage T4 tail made using cryo-elec- 
tron microscopy, that a small but unidenti- 
fied protein is located at the tip of the phage 
spike’. By analysing bacteriophage genomes 
and T6SS gene clusters, they found that both 
contain genes encoding small proteins of 
the PAAR family. They then used structural 
approaches’ to show that these proteins have 
a conical shape that fits perfectly on top of 
the gp27,-gp5, (bacteriophage) and VgrG 
(T6SS) spikes. Moreover, the conical shape of 
the PAAR proteins makes the spikes sharper, 
emphasizing their role in puncturing bacterial 
cell membranes. 

Most bacterial secretion systems are highly 
specific, such that a system from one bacter- 
ial species will not necessarily recognize and 
transport proteins from another, even if both 
species use a similar secretion system. Shneider 
and colleagues show that the T6SS is no 
exception, that PAAR proteins are essential 
for T6SS function, and that, although there 
is some flexibility, they cannot systematically 
be exchanged between T6SS systems of differ- 
ent origin. This led the authors to hypothesize 
that PAAR proteins could function as adaptors 
between the VegrG spike and the proteins or 
toxins to be transported (Fig. 1b). 
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A bioinformatics analysis showed that 
PAAR proteins are modular and that some 
have a carboxy-terminal extension with 
predicted enzymatic or toxin functions. This 
suggests that a PAAR protein sitting on the 
VgrG spike not only contributes to the pierc- 
ing of the target cell, but also transports the 
toxin, which is bound to it. However, it is worth 
noting that some VgrG spikes already have 
functional C-terminal extensions — such as 
the actin crosslinking domain of the bacterium 
Vibrio cholerae’ — and may not need a PAAR 
adaptor. 

This last part of Shneider and colleagues’ 
study is more speculative and relies essentially 
on hypothesis. Indeed, the authors’ interpreta- 
tions could cast doubt on the role of PAAR as 
a sharp spike, if this part is hidden by a toxin 
or enzyme. Further structural evidence is 
required to assess whether adding a toxin or 
protein at its C terminus hinders the PAAR 
protein’s puncturing activity. Ifit is shown not 
to, this will definitively reveal the T6SS to be a 
flexible and modular toxin-delivery device to 
which other domains could be added, multi- 
plying the possible functions of the system. 
The authors’ suggestion of such a mechanism 
means that the hunt for a broader panel of 
adaptors and toxins associated with the T6SS 
is now open. 

The molecular details of this fascinating 
mechanism might distract from the ques- 
tion of why the T6SS has attracted so much 
attention. The T6SS is known as a system for 
killing other bacteria or subverting cells*”® — 
a remarkable tool that can be used to build 
an optimal bacterial ecosystem, to colonize 
a host, or both, and which therefore has clear 
relevance to medicine and ecology. A secre- 
tion system per se has no function other than 
to transport proteins; the secreted proteins are 
the ‘software’ of the machines, and it is their 
activity that determines the physiological con- 
sequences of the secretion system. As such, 
the discovery of new TOSS effectors is likely to 
herald major breakthroughs. = 
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Climate extremes and the carbon cycle 


Markus Reichstein', Michael Bahn’, Philippe Ciais*, Dorothea Frank!, Miguel D. Mahecha’, Sonia I. Seneviratne’, 
Jakob Zscheischler!*°, Christian Beer’®, Nina Buchmann’, David C. Frank’"’, Dario Papale’, Anja Rammig'®, Pete Smith", 
Kirsten Thonicke’®, Marijn van der Velde'*, Sara Vicca!’, Ariane Walz'* & Martin Wattenbach!® 


The terrestrial biosphere is a key component of the global carbon cycle and its carbon balance is strongly influenced by 
climate. Continuing environmental changes are thought to increase global terrestrial carbon uptake. But evidence is 
mounting that climate extremes such as droughts or storms can lead to a decrease in regional ecosystem carbon stocks 
and therefore have the potential to negate an expected increase in terrestrial carbon uptake. Here we explore the 
mechanisms and impacts of climate extremes on the terrestrial carbon cycle, and propose a pathway to improve our 
understanding of present and future impacts of climate extremes on the terrestrial carbon budget. 


ing 25-30% of anthropogenic carbon dioxide (CO) emissions’, 

with much of this uptake occurring via carbon accumulation in 
forest biomass and soils’. Proposed mechanisms for this net carbon sink 
due to enhanced vegetation growth include CO, and nitrogen ferti- 
lization, and gradually increasing growing season length in northern 
regions. Overall, this terrestrial sink mitigates the anthropogenic increase 
of atmospheric CO, levels, and provides a negative feedback in the climate/ 
carbon-cycle system’. It is essential to investigate to what extent, for how 
long and in which ecosystems this net CO, absorption and negative feed- 
back will continue. 

To address these questions, coupled carbon-climate model experiments 
have been set up’*, the most recent being the Coupled Model Intercompa- 
rison Project (CMIP5), whose results are used in the current Intergo- 
vernmental Panel on Climate Change (IPCC) assessment. In the CMIP5 
comparison, state-of-the-art Earth system models incorporate a mecha- 
nistic description of the carbon cycle coupled with climate. In all of these 
models, the combined effect of COs, climate change and (in fewer models) 
nitrogen deposition leads to increased vegetation productivity, fostering 
enhanced carbon sinks in temperate and boreal regions and the above- 
mentioned negative feedback on climate change. Future projections of 
ecosystem responses and thus feedback strength are, however, highly 
uncertain*“. Recent studies indicate that the occurrence of extreme events, 
for instance heatwaves, droughts or storms, and the associated disturban- 
ces can partially offset carbon sinks or even cause net losses in carbon sto- 
cks, thereby releasing CO, to the atmosphere**. Because extreme events 
can trigger immediate and time-lagged responses of ecosystems, such as 
mortality, fires or insect infestations””®, their effects on carbon fluxes and 
stocks are nonlinear. Thus, even a small shift in the frequency or severity 
of climate extremes could substantially reduce carbon sinks and may result 
in sizeable positive feedbacks to climate warming. 

In this Perspective, we investigate the diverse impacts of climate extremes 
on the carbon cycle of terrestrial ecosystems. We start with a conceptual 
treatment of climate extremes from an impact point of view, analyse key 
ecosystem mechanisms triggered by climate extremes and make a first 
attempt to estimate the susceptibility of the carbon cycle in different ecosystem 


F or the past five decades terrestrial ecosystems have been absorb- 


types. Finally, based on a set of metrics on the magnitude of extremes 
calculated from multi-temporal Earth observation data and Earth system 
model simulations, we provide a first estimate of the relative magnitude of 
carbon-cycle deviations caused by climate extremes over the past 30 years. 
We propose that climate extremes have the potential to significantly 
affect the carbon cycle regionally and globally. To obtain reliable estima- 
tes of the sign and magnitude of future carbon-cycle feedbacks, a better 
understanding and descriptions of both the occurrence of climate extre- 
mes themselves and the ecosystem carbon-cycle processes that are trig- 
gered by climate extremes need to be achieved. To this end, we advocate 
a new generation of ecosystem manipulation experiments dedicated to 
studying extreme events, targeted long-term carbon-cycle observations, 
and an emphasis on high-resolution climate and biosphere modelling. 


Climate extremes and the biosphere 

The study of climate and weather extremes has a long history in climato- 
logy and hydrology. This research has led to a commonly applied statis- 
tical framework for defining climate extremes'"’’. Yet such definitions of 
extremes, based on climate statistics alone, are not necessarily well suited 
for assessing the impact on ecosystems and their carbon cycle, as illustra- 
ted in the following thought experiment. Consider a year with precipita- 
tion of, say, 499 mm—this observation is regarded as a statistical extreme, 
if for the past 100 years annual precipitation has always been between 
500mm and 510mm (although with considerable seasonal variation), 
despite the fact that this difference of 1-11 mm or 0.2-2% is negligible in 
terms of its ecosystem impact. Similarly in the real world, if monthly winter 
temperatures at high latitudes typically vary between —30 °C and —40 °C, 
a month with —25 °C again is extreme according to the climatological 
definition, but is far below any critical threshold for inducing an ecosys- 
tem response. 

A specific definition of climate extremes relevant for terrestrial eco- 
systems is thus needed, where the extremeness in the expected response, 
not only in meteorological drivers, is considered. Accordingly, it has been 
suggested that an extreme climatic event should be defined as “an episode 
or occurrence in which a statistically rare or unusual climatic period 
alters ecosystem structure and/or functions well outside the bounds of 
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what is considered typical or normal variability”’’. This definition is gene- 
ral enough to include meteorological constellations that are not extreme 
for a single variable but are extreme for a combination of variables (a 
multivariate extreme or compound event'*"*), such as a combined heat- 
wave and drought, or a drought followed by extreme precipitation. 

To emphasize the impact perspective further, we restate this definition 
of biosphere-relevant climate extremes as “conditions where an ecosystem 
function (such as carbon uptake) is higher or lower than a defined extreme 
percentile during a defined time period and over a certain area, traceable 
to single or multivariate anomalous meteorological variables”. As a conse- 
quence of this definition, the identification and detection of extremes is 
first focused on ecosystem diagnosis, and then requires the attribution of 
an extreme ecosystem impact to immediate and lagged effects of meteoro- 
logical variables (also see “Carbon-cycle extremes in the satellite period” 
section). 

There is clear evidence that extreme events not only affect the carbon 
cycle concurrently (for example, by reducing vegetation productivity or 
destroying carbon stocks during fire events), but can initiate lagged res- 
ponses. For example, one year after an anomalously warm season, soil 
heterotrophic respiration was enhanced in a grassland, offsetting net 
ecosystem carbon uptake"; soil frost increased the sensitivity of hetero- 
trophic respiration to summer drought in a forest'®; and increased tree 
mortality was measured after severe droughts in many instances’”-”’. 
Lagged and legacy effects of extreme events on ecosystem carbon cycling 
are poorly understood, and can potentially involve multiple synergistic 
and antagonistic mechanisms operating in parallel at different levels of 
organization and timescales, for which a hierarchical response framework 
has been suggested”®. These mechanisms include (1) diminished plant 
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Figure 1 | Processes and feedbacks triggered by extreme climate events. 
The extreme events considered are droughts and heatwaves, heavy storms, 
heavy precipitation and extreme frost. Solid arrows show direct impacts; 
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resistance to abiotic stress (for example, via antioxidants, osmolytes or 
changes in membrane stability), pests and pathogens (for example, via 
altered secondary metabolites) and their effects on plant performance; (2) 
changes in the amount, quality and timing of litter and rhizodeposition; 
(3) effects on soil physical and chemical characteristics (soil organic matter 
fractions, aggregate stability, hydrophobicity*'); and (4) shifts in plant, 
microbial and animal species composition (for example, increased fungi, 
because fungi are more drought-resistant than bacteria’’) and associated 
changes in carbon and nitrogen cycling, which feed back to (1)-(3)”. 

Future research should address such mechanisms in integrated stud- 
ies at the ecosystem scale, and provide a mechanistic basis for projecting 
effects of extreme meteorological constellations on the carbon cycle. In 
addition to the above-mentioned mechanisms at ecosystem level, we 
also envisage lagged effects in societal and economic systems with sub- 
stantial lagged responses in the carbon cycle, for example if increasing 
food prices caused by low yields, combined with risk of wind throw, were 
to encourage conversion of forests to croplands or grasslands. 


Ecosystem-dependent processes and impacts 


Climate extremes induce a suite of interconnected effects, all of which have 
the potential to alter the carbon balance of ecosystems profoundly on 
different timescales (Fig. 1). This is illustrated by the effect of heatwaves 
and dry spells: this type of climate extreme has a direct effect on CO, 
fluxes, because both photosynthesis and respiration respond to warmer 
temperature and soil moisture limitation. Moreover, these factors work 
synergistically at the leaf, ecosystem and the regional scales as follows. 
Drought leads to stomatal closure by plants, decreasing leaf transpiration 
and evaporative cooling, aggravating the effect of high air temperatures 
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dashed arrows show indirect impacts. The relative importance of the impact 
relationship is shown by arrow width (broader arrows are more important). 
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(and strong short-wave radiation)”’. Similarly, at the regional scale, soil 
moisture-temperature feedbacks can lead to a higher likelihood of heat- 
waves under dry soil conditions”. 

Moreover, soil drought, meaning more negative soil water potential 
and low soil hydraulic conductivity, usually leads to a higher vapour- 
pressure gradient between leaves and the atmosphere, causing a stress on 
the hydraulic system of plants that is further exacerbated by high tem- 
peratures. Consequently, high tension in the xylem can trigger embolism 
and partial failure of hydraulic transport in the stem, and can even be a 
contributory factor to mortality. This mechanism is currently regarded as 
a dominant cause of tree mortality under drought'®’®, interdependent 
with other postulated mechanisms related to the carbohydrate metabol- 
ism and insect infestations. In this respect, plant mortality can bea lagged 
effect of heat and drought that has impacts on the carbon balance for 
decades at least, and can lead to changing vegetation cover with associa- 
ted feedbacks to local and regional hydrology and climate. Further effects of 
drought stress include an increase of fire risk and—as a delayed response— 
pathogen and pest outbreaks, the latter also being related to the sensiti- 
vity of ecosystems to heavy storms and wind-throw-related mortality in 
forests (Fig. 1). 

Although the mechanisms triggered by different climate extremes can be 
described in conceptual terms as in the section above, their specific impact 
is highly dependent on ecosystem type. We summarize in Table 1, and 
the following section, specific anticipated effects of climate extremes for 
forests, peatlands, grasslands and croplands. 
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Forests and peatlands 


Forests are characterized by the large biomass carbon stocks per square 
metre, which are vulnerable to wind throw, (ice-)storms, frost, drought, 
fire and pathogen or pest outbreaks. Moreover, given that trees take a long 
time to regrow, recovery times for forest biomass lost through extreme 
events are particularly long. Hence, the effects of climate extremes on the 
carbon balance in forests are both immediate and lagged, and potentially 
long-lasting. The impact ofa climate extreme depends partially on events 
that have happened years before. Thus, assessing the net effect of climate 
extremes on forest carbon stocks requires a clearly defined time horizon 
during which immediate and lagged losses or gains can be assessed. In 
addition, any climate mitigation policy that relies on long-term carbon 
storage in forest biomass and the forest soil must guard against the 
likelihood of loss of accumulated carbon stocks in the face of future 
extreme events. 

Although forests are potentially susceptible to all types of extreme event 
(Table 1), globally, drought is the most widespread factor affecting the 
carbon balance. For instance, during the European 2003 heatwave, pre- 
cipitation (and soil moisture) deficit rather than temperature was the 
main factor reducing the water and carbon fluxes in the temperate and 
Mediterranean forest ecosystems”””*. Severe and recurrent droughts have 
been identified as a major contributing factor in the recently accelera- 
ted rates of tree decline and mortality of forests'?”°*". Beyond the most 
extensively studied mid-latitude forest belts, drought is also a relevant 
driver of the physiology and carbon cycling of the highly productive tropical 


Table 1 | How forest, grasslands and croplands are affected by climate extremes 


Land-cover type Extremes Key impact mechanisms Examples of documented Scientific Scientific 
highly susceptible regions understanding understanding of 
of future carbon-cycle 
occurrence!® impact 
Forest Storms e Wind throw transforms carbon stock from The Amazon®®, North Low Medium 
living biomass to dry, dead wood America?®?’, central 
e Wind throw increases risk of fires and Europe®® 
pathogen outbreaks 
Drought e Water availability affects plant physiology, Central Europe?”?®, Low to medium Low 
Heat phenology and carbon allocation patterns western North America®, Medium to high Low 
e Increased tree mortality, fire risk and the Amazon?°32 
susceptibility to pathogens 
e Shifts in vegetation composition (impacts are 
large and delayed owing to the longevity of trees) 
Fire ¢ Tree mortality has a large, fast impact on large Western North America’®, Low Low 
carbon stocks in forests southeastern Asia’, the 
Mediterranean’’, the 
circum-boreal areas°°, 
the Amazon’® 
Ice storm e Physical damage can include destruction of China®°, North Medium to high Low 
and frost whole forest America®! ®2 (for cold 
e Xylem embolism and desiccation’? temperatures) 
Grasslands Drought e Species composition shifts (especially combined North America®? **, Low to medium Medium 
Heat with additional pressure such as overgrazing) Europe®®, central Asia®® Medium to high Low 
e Degradation and desertification (especially 
combined with overgrazing) 
e Erosion (combined with heavy precipitation or 
storms) 
Croplands Storms e Wind erosion and soil displacement with China®’, North America®® Low Low 
unclear consequences for the carbon cycle 
e Direct crop damage 
Heavy e Erosion causing loss and displacement of The tropics®®, North Medium to high Low 
precipitation soil and hence carbon America®, Australia??, (low for hail) 
(including hail) e Erosion affecting the soil’s long-term the Mediterranean??, 
productive capacity western Europe™, 
e Crop damage or failure caused by hail east Asia? 
and waterlogging of soils and subsequent 
anaerobic conditions 
¢ Crop lodging, that is, the permanent 
displacement of cereal stems from the vertical 
e Increase of pests and pathogens 
Drought and e Reduced growth or complete crop failure Europe>3*23, Low to medium Medium 
heat North America®, China?° 
Extreme cold e Reduced growth North America?®, south Medium to high Low 


e Complete winter-crop failure, especially during 


Australia?’, Europe’? 


spring frosts (combined with drought stress) 
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forests**. Amazonian forests were estimated to have lost 1.6 petagrams 
(10'° g) of carbon (Pg C) and 2.2 Pg C following the severe droughts of 
2005 and 2010, respectively (refs 30 and 32). Soil conditions and rooting 
patterns play an important part in resilience and resistance”. 

Heavy storms and tropical cyclones regularly lead to severe forest damage, 
resulting in the loss of major limbs and foliage and widespread mortal- 
ity®. Storms are considered to be the most important natural disturbance 
affecting European forests**—the Lothar superstorm reduced the European 
standing forest biomass stocks in 1999 by about 16 teragrams (10'” g) of 
carbon (Tg C), corresponding to approximately 30% of the net biome 
production in Europe*’. Similarly, (sub-)tropical forests are vulnerable to 
wind-storm mortality and tropical-cyclone-driven mortality**””. A single 
squall line propagating across Amazonia in January 2005 caused wide- 
spread forest tree mortality, and threw to the ground the equivalent of 
23% of the basin-wide mean annual biomass accumulation”**. 

Around half of the average annual flux of 2.0 Pg Cyr ' globally emit- 
ted by fires between 1997 and 2009 were from forest ecosystems, with 
20% from deforestation and degradation fires in tropical forest, 16% 
from woodland fires, and 15% from (mostly extra-tropical) forest fires 
during the MODIS satellite era (2001-2009)*”. Fires and pest outbreaks 
can be facilitated by climate extremes in subtropical, Mediterranean and 
boreal forests and woodlands, but a direct link is not always easy to prove, 
because many factors, including direct human action, can trigger pests 
and fires. Nevertheless, climate anomalies associated with El Nino epi- 
sodes have been shown to cause extreme fire events in tropical forests, 
affecting Amazon rainforest” and tropical southeast Asia, the latter con- 
tributing 66% of the atmospheric CO, growth rate anomaly during the 
1997-1998 event". 

Peatlands have large carbon stocks, as forests do, but they are below 
ground and are mostly preserved by their high water table, which limits 
decomposition. Tropical peatlands contain approximately 90 Pg C world- 
wide and are particularly susceptible to drought and fires”. Northern peat- 
lands contain 500 Pg C (ref. 43) and are susceptible to hydrological extremes, 
such as droughts and heavy rainfall, which in combination can lead to both 
losses of carbon by decomposition and immediate CO, release and by export 
of dissolved organic carbon, by which a quarter of net ecosystem produc- 
tion can be lost™. 

In summary, forest (and woodland) ecosystems are potentially suscep- 
tible to all climate extremes, with a plethora of important processes and 
indirect effects, as indicated in Fig. 1. With both large carbon stocks (stand- 
ing biomass) and carbon fluxes being strongly affected by extremes, forest 
is the most sensitive biome to climate extremes. Yet many processes (for 
example, mortality) do not lead to immediate release of carbon to the 
atmosphere, but rather to a committed release via decomposition (Box 1, 
Fig. 2). Ifextremes are followed by fast regrowth and in each recovery cycle 


carbon with increased residence times is generated (for example, by 
subsurface transport or charcoal formation), the whole process can be 
carbon neutral, or potentially can even lead to increased long-term sinks, 
though this has not yet been demonstrated. 


Grasslands and savannas 


Grasslands are susceptible to drought, whereas in contrast to forests, other 
extremes (for example, storms) play a smaller, if not negligible, part. Fire 
is often an intrinsic factor in grasslands and savannahs, and cannot be con- 
sidered an extreme. It does, however, contribute to the suppression of 
trees in woody savannas (such as Miombo) and hence the suppression of 
build-up of high above-ground carbon stocks. Marcolla et al.*” showed 
strong acclimation ofa grassland to the interannual variability of climate, 
leading to a dampening of the interannual variability of the carbon 
balance in an Alpine grassland. Grasslands are also characterized by 
the high recovery potential of plant growth, as observed both in managed 
and in unmanaged grasslands**’’, although the timing of the drought 
event may be a crucial factor in this regard**. Overall, this high resilience 
distinguishes grasslands clearly from forests and explains why, on a global 
scale, grasslands often prevail in climates where extremely dry years can 
occur. However, once degradation feedbacks come into play, where 
drought triggers loss of vegetation and heavy rain causes subsequent 
erosion, more frequent extreme events may contribute to desertification 
of semi-arid to arid grassland, in particular when (over-)grazing is an 
additional pressure. 


Croplands 


There are at least three characteristics in which croplands differ from other 
ecosystems with respect to carbon-cycle responses to climate extremes. 
First, cropland systems are entirely managed, and management inter- 
ventions can react on short timescales (for example, irrigation). Second, 
under annual cropping, the soil-vegetation system is reset regularly through 
harvest and agricultural management such as tillage, manure/residue 
management and irrigation. Third, type and duration of crop cover are 
highly variable, and the soil can be bare for an extended period of time. 
Consequently, the response to extremes is highly modulated by human 
intervention both immediately and over longer periods (for example, by 
changing cultivars or cultivation practice)”. 

Thus, the nature and extent of human interventions present one of the 
greatest uncertainties in assessing the impact of extremes on the carbon 
balance of croplands™. Side effects will also very probably result from active 
interventions by farmers. Increased irrigation may enhance root biomass 
production, microbial activity and erosion rates, leading to increased or 
decreased soil organic carbon stocks. Regular harvesting and soil treat- 
ment makes long-term biological legacy effects of climate extremes more 


BOX | 


still be detected as part of an annual growth anomaly. 


than does the usual litter input®. 


processes on the carbon balance remains unclear. 


Carbon processes triggered by climate extremes 


Some extremes produce a direct biogeochemical signal in the atmosphere (Fig. 2, orange arrows), and may first be detected as regional anomalies in 
atmospheric COz concentrations, analogous to a pollution plume. Globally, such signals are diluted by atmospheric mixing, but if large enough may 


In other cases, the effect of a climate extreme is essentially to alter the turnover rate of terrestrial carbon pools, leading to prolonged release of CO> to 
the atmosphere (see Fig. 1 for mechanisms) as a legacy from the extreme. For example, after a peak in mortality induced by wind throw or extreme 
drought, dead wood will take decades to decay. Such losses, often referred to as ‘committed COz emissions’, may contribute to discrepancies between 
detected biosphere carbon-cycle anomalies and atmospheric COz signals. 

Although they may be less obvious, climate extremes may also trigger processes that decrease the turnover rate of some carbon pools and lead to 
additional long-term sequestration in these pools. A well-known example is the creation of charcoal during fire, which generally persists longer in soils 


Transport processes of particulate or dissolved organic carbon outside the ecosystem, either towards the subsurface or laterally into lake and river 
systems, have recently been shown to move relevant amounts of carbon®?. These processes can lead to both stabilization and destabilization of 
carbon, that is, they may decrease or increase turnover times. Indirect effects on biogeochemical cycling in lakes (such as stimulation of turnover) add 
to the complexity induced by the landscape-scale lateral processes, and the connection between ecosystems. The net effect of these transport 
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Figure 2 | Overview of how carbon flows may be triggered, or greatly 
altered, by extreme events. Emphasis is on the potential contrast between the 
concurrent and delayed signal in the atmosphere. Concurrent effects mean that 
the carbon signal can be found in the atmosphere while the climate extreme is 
occurring. Delayed and long-term signals occur either well after the extreme 
event has occurred, or are too small compared to the background short-term 


unlikely than in forests or grasslands, but the legacy effect due to the 
impacts of pathogen population dynamics cannot be excluded in crop- 
lands. In addition, during a critical phase of its development, crop species 
can be vulnerable to a very specific stressor, which is otherwise unim- 
portant during the rest of the year. For instance, rice pollen can become 
sterile if air temperature passes a 37 °C threshold during the short pol- 
lination phase in spring*'. In general, the impact of an extreme event is a 
crop-specific function of the timing of the extreme, in relation to the 
sensitivity of the plant during its growth stage”. Note also that crop yield 
can be decoupled from the carbon balance in the face of extreme climate 
conditions. For instance, a very high carbon uptake caused by an excep- 
tionally warm winter did not induce high yield at a wheat site in Belgium, 
because of unfavourable weather conditions during the grain maturation 
stage”. 


Carbon-cycle extremes in the satellite period 

Satellites enable us to evaluate the state of land vegetation. For instance, 
the fraction of absorbed photosynthetically active radiation (fAPAR) often 
serves as a spatiotemporal indicator for vegetation activity. In tandem with 
global networks of station measurements of land-atmosphere exchange 


GPP reduction 
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variability to be immediately detectable. The concurrency and signal strengths 
involved with lateral transport probably depend on the transport distance, 
which is indicated by the colour gradient (from orange to purple). All 
concurrent fluxes can also be delayed, given the mechanisms and causal chains 
depicted in Fig. 1 and discussed in the main text. Sources and sinks are meant 
relative to conditions without extremes. 


fluxes of CO; (refs 54 and 55) (that is, the FLUXNET initiative), modern 
machine-learning methods allow us to translate fAPAR into robust esti- 
mators for gross primary productivity (GPP). Today these continuous glo- 
bal data streams form a natural basis on which to examine the past three 
decades and to quantify the impacts of climate extremes on the carbon 
cycle across the world’s ecosystems. 

Zscheischler et al.*° followed this impact-oriented search strategy and 
inventoried three decades of extremes in fAPAR anomalies with an 
emphasis on large spatiotemporally contiguous events. Using a 10% 
threshold to define fAPAR extremes, they estimated that the associated 
decrease in global GPP amounts on average to 2.7 PgCyr_' during the 
period 1982-2011 (using the fifth and first percentiles, the GPP decreases 
are 1.9PgCyr | and 0.7PgCyr |, respectively). Regionally, the most 
pronounced reduction of GPP (64 Tg C yr_') was observed in Northern 
Asia (following the IPCC nomenclature for regions’”). Further, the most 
vulnerable regions to extremes are eastern Africa (55 Tg Cyr), eastern 
Asia (53 Tg Cyr’), northeastern Brazil (53 TgCyr_') and central North 
America (47 Tg Cyr’). An analogous analysis of the CMIP5 model runs! 
over the same period results in a global decrease of GPP due to extre- 
mes of 9.60PgCyr | at the tenth percentile (5%: 6.56 Pg Cyr |; 1%: 


15 AUGUST 2013 | VOL 500 | NATURE | 291 


©2013 Macmillan Publishers Limited. All rights reserved 


| RESEARCH | PERSPECTIVE 


2.28 PgCyr__') averaged over all model runs. This higher sensitivity of 
GPP from models compared to Earth observation data could arise from 
model bias and (in tropical regions) from the difficulty of obtaining 
good fAPAR data from satellite observations and a consequent under- 
estimation of GPP interannual variability in the satellite-FLUXNET 
data produced”’. 

Attributing the observed fAPAR extremes over the past 30 years to ano- 
malies in either temperature and/or water availability, or to fire events yields 
the strongest association with drought*®. Globally, from the hundred lar- 
gest negative fAPAR extremes identified from satellite measurements 
between 1982 and 2011, 56 events are explainable by water scarcity, 14 by 
extreme high temperatures and 10 by exceptionally large fires (note, 
however, that of the hundred largest events only 43 fall in the time frame 
where state-of-the-art fire data are available, that is, 1997-2010). Thirty- 
five negative fAPAR extreme events could not be attributed to a particular 
driver. A regional exploration reveals systematic associations of negative 
fAPAR extremes with high water deficits in Africa, India and south- 
western Russia (Fig. 3a). Associations with both high temperatures and 
water deficit are found over Australia, North America and South 
America. Some of the remaining fAPAR extreme events, not attributable 


to temperature or drought, might be caused by large-scale wind throw, by 
biotic events such as pest outbreaks or by a complex response to multiva- 
riate extremes (extremes not manifested in individual variables but in their 
uncommon coincidence). The total impact of extreme, spatiotemporally 
contiguous fAPAR anomalies on GPP is most strongly influenced by the 
spatial extent at which ecosystems have been affected by regionally extreme 
climate conditions (Fig. 3b). In contrast, the maximum intensity and the 
duration of each regional GPP anomaly are of secondary relevance. 

It has been proposed that the size distribution of disturbance events in 
terrestrial ecosystems scales with a power law, like multiple other pro- 
cesses that are governed by phenomena of self-organized criticality”. 
Scrutinizing the distributions of spatiotemporally integrated GPP extremes 
supports this contention. Zscheischler et al.°° found that the magnitude 
of GPP extremes (fifth percentile) follows a power law with an exponent 
of « lying in the remarkably narrow range of 1.74 + 0.02 across all 
vegetated continents (Fig. 3c). These values of % are consistent with 
previous estimates for the spatial extent of disturbance events in tropical 
and subtropical forests**. On the one hand, the convergence of these 
macro properties points towards a coherent description and scaling of 
extremes in the global carbon cycle. On the other hand, we can also 
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Figure 3 | Global impact of extreme events on the carbon cycle. a, Global 
distribution of extreme events impacting the terrestrial carbon cycle, defined as 
contiguous regions of extreme anomalies of fAPAR (lower first percentile) 
during the period 1982-2011 following the nomenclature of ref. 56. The 
hundred largest events on each continent are shown, along with whether they 
can be associated with water scarcity (blue, W), extreme high temperatures 
(red, T), both (pink) or neither (grey). The colour reflects the intensity of the 
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extreme event in terms of integrated loss of GPP, as indicated in the inset. 

b, Correlation of spatiotemporally integrated event impact size with maximum 
spatial extent (‘Area’), duration (‘Duration’) and maximum intensity 
(‘Maximum anomaly’). c, Size distribution of spatiotemporally integrated event 
impacts, following a power law with similar scaling exponents across all 
continents. « is the scaling exponent. 
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differentiate between different continents: Africa and South America are 
most likely to be hit by an extreme GPP impact of a given magnitude 
(Fig. 3c). 


Spatiotemporal context and quantification 


Detecting spatially and temporally contiguous extreme impacts in the 
carbon cycle and attributing them to climate extremes is naturally limi- 
ted by data availability. Especially, the quantification and attribution of 
effects of extreme events on the carbon balance strongly depends on whether 
the considered time frame also captures lagged responses, and other 
carry-over effects as well as ecosystem recovery processes (see also Box 1 
and Figs 1 and 2). Clearly, the spatial scale of integration affects the quan- 
tification of impacts of climate extremes on the carbon cycle, for example, 
when the climate extreme causes a transfer of carbon from one ecosys- 
tem to another. Inland waters are only recently considered as important 
sources or sinks of carbon following extreme events. 

Carbon stored in lakes and rivers originates from soils and wetlands, 
with a smaller contribution from autochthonous net ecosystem produc- 
tion. In particular, heavy precipitation events may cause substantial late- 
ral transfers of particulate and dissolved organic carbon from terrestrial 
to aquatic ecosystems, where it may be partly respired and partly buried’, 
leading to an increased carbon sequestration. Aquatic systems are estimated 
to be the second most important ecosystem in terms of carbon sequestra- 
tion, ranking above croplands and grassland in Europe”. Further, erosion 
in general transports carbon from one system toa neighbouring one, where 
the imported carbon may be accumulated and turned over, and may poten- 
tially also cause carbon mobilization of autochthonous carbon pools”. 


Coming to grips with carbon-cycle extremes 


As shown above, climate extremes trigger anomalous pulses in the car- 
bon cycle, which can temporarily offset carbon sinks (see references in 
Table 1) and create CO, concentration signals detectable by large-scale 
atmospheric observations if the spatial extent of the affected region is 
large enough*'. The analysis by Zscheischler et al. (ref. 56 and the above 
section) indicates that anomalies in vegetation CO, uptake induced by 
climate extremes add up to an average signal of global relevance, that is, 
of similar magnitude to that of the terrestrial carbon sink’. Yet, ona global 
scale, the global growth rate of atmospheric CO) indicates that the land 
and oceans have continued to take up CO; with nearly the same relative 
strength as in the past”. This suggests that negative impacts of climate 
extremes on the global terrestrial carbon sink have so far not been increa- 
sing or decreasing disproportionately, so that ona larger scale the net biome 
production® (that is, the large-scale carbon balance including uptake, res- 
piration, fire emissions, leaching, lateral transport and harvests) remains 
in the long term a constant fraction of anthropogenic CO, emissions. 
The terrestrial biosphere is the largest contributor to the year-to-year 
variability of the atmospheric CO, growth rate, but its relation to climate 
variability is not fully understood, leaving interannual residuals ranging 
between —2PgCyr ‘and +2PgCyr | when trying to close the global 
carbon balance’. The effects of climate extremes on the carbon cycle have 
the potential to explain part of this residual variation. However, simple 
relationships between CO, growth rates and climate extremes are not 
necessarily expected, because of the above-mentioned lag and legacy 
effects involving ‘committed’ CO, emissions—regionally distinct con- 
stellations of climate variables causing carbon extremes, and differences 
in the response of ecosystems modulated by management. 

Overall, the most pressing question is to what extent the coupling between 
increasing climate extremes’” and induced CO) losses to the atmosphere 
might offset or even outweigh ecosystem carbon uptake arising from 
gradually increasing CO, concentration, prolonged extratropical grow- 
ing season length and nitrogen deposition. Available observations and 
state-of-the-art modelling efforts are insufficient to provide definitive 
answers. Observational records of climate impacts on ecosystems and 
the carbon cycle are often too short and not widespread enough to pro- 
vide sufficient context to interpret and mechanistically understand rare 
events. Continued simultaneous site-level observations of climate variables, 
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together with carbon and water cycles, such as those being carried out ina 
global observation network (FLUXNET™) and global satellite-based 
observation of atmosphere and biosphere states, remain pivotal in this 
context. However, these should be more strongly complemented with 
longer time series from archived impacts, for instance in tree rings or 
sediments”. 

Moreover, we need targeted assessments in regions where impacts of 
climate extremes have occurred, emphasizing the impact-oriented per- 
spective proposed in this study. Nearly real-time information on the 
biosphere from satellite remote sensing and observation networks should 
enable rapid-response scientific campaigns to study after-effects and post- 
disturbance trajectories resulting from climate extremes. In particular, 
improved and repeated carbon state observations (such as biomass) from 
space would provide unprecedented information. Better mobilization of 
satellite imaging capabilities (that is, pointing satellites to an area affected 
by extremes, fast access to satellite data as for tsunami alerts’) and online 
data processing and distribution of flux tower and atmospheric station 
data® will also offer the research community new tools with which to study 
carbon extremes and provide information needed for policy-making. 
Given the importance of drought, we strongly advocate stronger inte- 
gration of carbon-related and hydrological observations and modelling 
efforts. More extensive high-quality soil moisture networks are critically 
important to better sample droughts in the affected regions. 

Apart from long-term and ‘fast-response’ observations, it is also par- 
ticularly important that future event-based manipulation experiments 
specifically test for thresholds and tipping points””’ of ecologically rele- 
vant processes that can trigger long-lasting changes. In current manipu- 
lation experiments, these nonlinearities are rarely tested for or identified”. 
To considerably improve our mechanistic understanding of ecosystem 
responses to extreme events, it is crucial to perform increasingly standar- 
dized sets of measurements that permit calculation of common metrics 
to help us deal with the enormous variation in manipulation intensities, 
timing, return frequencies and duration in a coherent manner’’. Compari- 
son of heatwave experiments requires quantification of the manipulation 
as experienced by the biota in terms of temperature as well as physiolo- 
gical water availability. Furthermore, a comprehensive approach with 
relevant impact measurements made at different plant and ecosystem 
levels (for example, stress hormones, allocation patterns, changes in vege- 
tation community, as well as responses of soil processes) would substan- 
tially increase the value of future experiments. In addition, targeted and 
rapid response studies taking advantage of naturally occurring extremes 
(which can be further modified experimentally) should be considered. Fol- 
lowing our global analysis, where we find tropical regions strongly affec- 
ted by climate extremes (Fig. 3), we suggest that future emphasis should 
be placed on the tropical biomes. 

Despite remarkable progress over the past decade, climate models still 
do not realistically simulate most climate extremes, because their designs 
are optimized to represent the long-term transient changes in the Earth 
system”’. Correspondingly, there also remain large uncertainties in pro- 
jections of changes in biosphere-relevant climate extremes’* (Table 1). 
However, given that droughts and related hydrological processes seem to 
be the dominant regional trigger for carbon-cycle extremes, an impro- 
ved representation of drought dynamics in climate models will be crucial. 
More realistic representation of atmospheric dynamics and particularly 
convective processes and cloud formation” as well as soil hydrology” 
will be required. Although recent emphasis has been on building Earth 
system models that integrate climate and biogeochemical cycles, higher- 
resolution modelling efforts and improved physics should provide the 
basis for better projections of future climate extremes and their conse- 
quences for Earth’s carbon cycle. 

Knowledge gained from new observations and experiments oriented 
towards climate extremes will enable the most important internal ecosys- 
tem feedbacks to be included and appropriately parameterized in terrest- 
rial biosphere models. Given that nutrient cycles modulate the response 
of carbon fluxes to extreme events, global carbon-cycle modellers should 
aim for multi-element modelling (carbon, nitrogen, phosphorus). As in 
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climate models, this will require the adoption of higher spatial process 
resolution in carbon-cycle models. Biological adaptation and the role of 
biodiversity also remain particularly unclear, and thus are hardly consi- 
dered in state-of-the-art Earth system models. One would expect adapta- 
tion and functional biodiversity to dampen the effect of climate extremes 
on the carbon cycle, but this has not been sufficiently established to jus- 
tify their inclusion in currently operational global models. 

The same is true for land management effects and their relation with 
climate extremes. The lateral and vertical transport of carbon out of the 
ecosystem into the hydrosphere (groundwater, rivers, lakes) during pre- 
cipitation extremes calls for more strongly landscape-oriented land sur- 
face models, where parts of the landscape are intimately coupled (see Box 1 
and Fig. 2). We suggest that coupling between the carbon and water 
cycles requires urgent attention, including how the carbon state variables 
feed back to the water cycle (such as soil organic carbon effects on infil- 
tration and water-holding capacity), thus affecting susceptibility to meteo- 
rological drought. 
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Down’s syndrome is a common disorder with enormous medical and social costs, caused by trisomy for chromosome 21. 
We tested the concept that gene imbalance across an extra chromosome can be de facto corrected by manipulating a 
single gene, XIST (the X-inactivation gene). Using genome editing with zinc finger nucleases, we inserted a large, 
inducible XIST transgene into the DYRKIA locus on chromosome 21, in Down’s syndrome pluripotent stem cells. The 
XIST non-coding RNA coats chromosome 21 and triggers stable heterochromatin modifications, chromosome-wide 
transcriptional silencing and DNA methylation to form a ‘chromosome 21 Barr body’. This provides a model to study 
human chromosome inactivation and creates a system to investigate genomic expression changes and cellular 
pathologies of trisomy 21, free from genetic and epigenetic noise. Notably, deficits in proliferation and neural rosette 
formation are rapidly reversed upon silencing one chromosome 21. Successful trisomy silencing in vitro also surmounts 
the major first step towards potential development of ‘chromosome therapy’ . 


In the United States, about 1 in 300 live births carry a trisomy, half of 
which are for chromosome 21, which causes Down’s syndrome. Down’s 
syndrome is the leading genetic cause of intellectual disabilities and the 
millions of Down’s syndrome patients across the world also face multiple 
other health issues, including congenital heart defects, haematopoietic dis- 
orders and early-onset Alzheimer’s disease’. Down’s syndrome research- 
ers have sought to define the genes on chromosome 21 most closely 
associated with Down’s syndrome, but this has proven difficult due to high 
genetic complexity and phenotypic variability of Down’s syndrome, con- 
founded by normal variation between individuals’. Despite progress with 
mouse models for Down’s syndrome*”, there remains a need for better 
ways to understand the underlying cell and developmental pathology of 
human Down’s syndrome, key to therapeutic design of any kind’. 

The last decade has seen great advances in strategies to correct 
single-gene defects of rare monogenic disorders, beginning with cells 
in vitro and in several cases advancing to in vivo and clinical trials°. In 
contrast, genetic correction of the over-dose of genes across a whole 
extra chromosome in trisomic cells has remained outside the realm of 
possibility. Our effort was motivated by the idea that functional cor- 
rection of living trisomic cells may be feasible by inserting a single 
gene that can epigenetically silence a whole chromosome. An indu- 
cible system for such ‘trisomy silencing’ would have immediate trans- 
lational relevance as a resource to investigate the cellular pathology 
and gene pathways affected in Down’s syndrome, in a setting free 
from pervasive genetic or epigenetic variation that exists between 
individuals, sub-clones, or even isogenic cell isolates*”*. 

There is a natural mechanism to compensate the difference in dosage 
of X-linked gene copies between mammalian females (XX) and males 
(XY). This is driven by a large (~17 kilobases (kb) in human), non-coding 
RNA, XIST, which is produced exclusively from the inactive X chromosome’, 
and ‘paints’ (accumulates across) the interphase chromosome structure”. 
During early development, the XIST RNA induces numerous hetero- 
chromatin modifications and architectural changes which transcrip- 
tionally silence the inactive X chromosome and manifest cytologically 


as a condensed Barr body (reviewed in refs 11, 12). There is evidence 
for some DNA sequence specificity to XIST function, as certain human 
genes escape X-inactivation’’; however, autosomal chromatin has sub- 
stantial capacity to be silenced'*'*. Understanding the full potential of 
an autosome to be silenced, however, requires examination under condi- 
tions that avoid creating a deleterious functional monosomy. The strategy 
pursued here meets that requirement and creates a tractable model to 
study the distinct biology of human chromosome inactivation. 

As outlined in Fig. 1a, we set out to determine whether the human 
X-inactivation gene, XIST, could be inserted into one copy of chro- 
mosome 21, and enact a chromosome-wide change in its epigenetic 
state. We pursued zinc finger nuclease (ZFN)-driven targeted addition” 
of an inducible XIST transgene to the gene-rich core of chromosome 21 
in induced pluripotent stem (iPS) cells derived from a Down’s syn- 
drome patient. If accomplished, this milestone would provide a system 
to study Down’s syndrome cell pathology and the first step towards a 
potential genetic/epigenetic approach to “chromosome therapy’. 


Insertion of XIST into a trisomic chromosome 21 


Given its large size, neither the XIST gene nor its cDNA has previously 
been integrated in a targeted fashion. Thus, our first goal was to demon- 
strate that ZFNs could accurately insert the largest transgene to date, 
substantially larger than sequences commonly used for genome editing"®. 
We first attempted this with a ~16-kb XIST transgene in a transformed 
cell line (HT1080), using established ZFNs to the AAVS1 locus on chro- 
mosome 19 (ref. 19). This proved highly successful (our unpublished 
data). To extend this to chromosome 21, we engineered ZFNs to a 36- 
base pair (bp) sequence in intron 1 of the DYRK1A locus at chromosome 
21q22 (as in Fig. 1b), and validated their robust activity (Supplementary 
Fig. la, b). We tested an even larger (~21 kb) construct containing near 
full-length XIST cDNA in HT1080 cells and demonstrated efficient, 
accurate addition to this gene-rich region (Supplementary Fig. 2a, b). 
We next determined whether this was achievable in technically chal- 
lenging iPS cells, which have unique therapeutic and developmental 
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Figure 1 | Genome editing integrates XIST into chromosome 21 in trisomic 
iPS cells. a, Concept for translating dosage compensation to trisomy 21. 

b, XIST construct (19 kb): two homologous arms and 14-kb XIST cDNA with 
inducible pTRE3G promoter. c, DNA/RNA FISH in interphase Down’s 
syndrome iPS cells shows that XIST overlaps one of three DYRK1A genes (left 
panels and insets) in a non-expressing cell (top, arrows), and a cell induced to 
express a large XIST RNA territory over the DYRKIA locus after 3 days in 
doxycycline (bottom, arrows). Right panels show green channel (DYRKIA) 
alone. Nuclear DNA is stained with 4’,6-diamidino-2-phenylindole (DAPI, 
blue). Scale bar, 2 um. d, OCT4 immunostaining and XIST RNA FISH ina 
transgenic colony: highly consistent XIST expression throughout the colony. 
Scale bar, 100 j1m. e, Metaphase DNA FISH shows one targeted chromosome 
21. XIST gene (asterisk and close-up) overlaps one of three DYRK1A genes 
(arrows). 


potential to form various cell types, and thus would be important for 
any future ex vivo cellular therapy efforts. We used a male Down’s 
syndrome iPS cell line” which we confirmed maintains pluripotency 
markers and trisomy 21. Although a constitutively transcribed trans- 
gene could be used, we engineered an inducible system to maximize 
utility for investigating Down’s syndrome biology. In one step, we 
integrated both the doxycycline-controlled XIST transgene into chro- 
mosome 21 (Fig. 1b and Supplementary Fig. 3a) and a transgene car- 
rying the doxycycline control component (rtTA) into the AAVSI 
chromosome 19 safe harbour, disruption of which creates no known 
adverse effects’? (Supplementary Fig. 3b). 

We analysed 245 colonies from the pooled transformants by inter- 
phase RNA/DNA fluorescence in situ hybridization (FISH) (Fig. 1c) 
to determine whether XIST was present and overlapped one of three 
DYRKIA alleles. Notably, 98.5% of XIST RNA-positive colonies car- 
ried XIST at this location on chromosome 21, and also contained the 
rtTA/selection transgene (Supplementary Table 1). Efficiency was 
sufficiently high that, through modifications to editing conditions, 
we obtained a few sub-clones with XIST integrated into two or even 
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all three alleles of DYRKIA (see Methods and Supplementary Fig. 3c 
and Supplementary Table 1). Six independent sub-clones were chosen 
for further study based on: an XIST transgene on one of three chro- 
mosome 21 copies, pluripotent colony morphology, OCT4 (also called 
POUSF1) staining (Fig. 1d and Supplementary Fig. 4a), and formation 
of embryoid bodies. FISH to metaphase chromosomes (Fig. le and Sup- 
plementary Fig. 3d) and Southern blotting (Supplementary Fig. 1c-e) 
confirmed the gene addition accuracy, with 47 chromosomes, for all six 
clones. High-resolution cytogenetic banding and/or array comparative 
genomic hybridization (CGH) on selected clones showed no significant 
abnormalities other than full chromosome 21 trisomy (Supplementary 
Fig. 4c-e). 


XIST RNA induces a chromosome 21 Barr body 


In the panel of six independent genome-edited clones, we induced 
transgene expression and detected XIST RNA by FISH 3 days later. A 
localized XIST RNA ‘territory’ over one chromosome 21 (Fig. 1c) was 
seen in over 85% of cells in all six clones (Fig. 1d and Supplementary 
Fig. 4a, b). This mirrored the unique behaviour of endogenous XIST 
RNA which ‘paints’ the inactive X chromosome nuclear territory’®. 

The natural inactivated X chromosome forms a condensed Barr body 
which carries repressive histone marks"’. Similarly, 5 days after XIST 
induction, the edited chromosome 21 became markedly enriched in all 
heterochromatin marks examined, including H3K27me3, UbH2A and 
H4K20me in 90-100% of cells and, later, with macroH2A (Fig. 2a, band 
Supplementary Fig. 5a). Supplementary Fig. 5b illustrates that H3K27me 
spreads across the whole metaphase chromosome 21. Moreover, chro- 
mosome 21 DNA in many nuclei became notably condensed, further 
evidence that we successfully generated a heterochromatic chromosome 
21 Barr body (Fig. 2c). 


Allele-specific silencing across chromosome 21 
To measure overall transcription across the XIST-targeted chro- 
mosome 21, we used an approach that we developed to broadly assay 
heterogeneous nuclear RNA (hnRNA) expression and to distinguish 
inactive from active X chromosome’’, on the basis of in situ hybrid- 
ization to CoT-1 repeat RNA. This showed that the chromosome 21 
XIST RNA territory was depleted for hnRNA detected by CoT-1 
(Supplementary Fig. 5c), similar to the inactive X chromosome”’. 
We next used multi-colour RNA FISH to determine the presence of 
transcription foci at each allele for six specific chromosome 21 genes, an 
established approach that we earlier showed'®’* discriminates active 
versus silenced genes on inactive X chromosome. Without XIST expres- 
sion, there are three bright transcription foci from each DYRK1A allele 
(Fig. 1c, top), but after XIST expression, the targeted allele becomes 
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Figure 2 | XIST induces heterochromatin modifications and condensed 
chromosome 21 Barr body. a, XIST RNA recruits heterochromatic epigenetic 
marks (for example, UbH2A). Channels are separated for cell indicated with an 
arrow. Scale bar, 5 tum. b, Percentage of XIST territories with heterochromatin 
marks. Mean = standard error, 100 nuclei in ~5 colonies. c, XIST RNA induces 
chromosome 21 Barr body visible by DAPI stain (arrow). Scale bar, 2 um. 
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weaker or undetectable, indicating repression of DYRKIA (Fig. 1c, 
bottom). 

The APP gene on chromosome 21 encodes B-amyloid precursor 
protein; mutations in APP which cause accumulation of B-amyloid 
lead to early-onset familial Alzheimer’s disease, and APP overexpres- 
sion is linked to the Alzheimer’s disease characteristic of Down’s 
syndrome’. Initially, three bright RNA transcription foci are apparent 
(Fig. 3a, top). Short-term XIST expression resulted in incomplete 
repression of the targeted allele (Fig. 3a, middle), which after 20 days 
was completely silenced, as shown in two independent clones (Fig. 3a, 
bottom, and Fig. 3b). 

We examined four more loci, 3-21 megabases (Mb) from XIST: 
ITSN1, USP25, CXADR and COL18A1. Complete silencing of each allele 
on the edited chromosome 21 was seen in ~ 100% of cells accumulating 
XIST RNA (Fig. 3c, d and Supplementary Fig. 6a). Allele-specific silen- 
cing was further validated using single nucleotide polymorphism (SNP) 
analysis. PCR with reverse transcription (RT-PCR) products for eight 
known polymorphic sites (in four genes) were sequenced (ADAMTS1, 
ETS2, TIAM1 and HSPA13) (Supplementary Fig. 6b, c). Interestingly, 
clones 2 and 3 showed an identical pattern of eight SNP alleles repressed, 
whereas clone 1 showed an alternative pattern of SNPs repressed. As 
summarized in Supplementary Fig. 6c, this chromosome-wide pattern 
allows extrapolation of the haplotype for each of the three chromosome 
21 homologues, and indirectly identifies for each clone which chro- 
mosome 21 was silenced by an XIST transgene. 

Wealso examined clones carrying XIST on two or all three copies of 
chromosome 21 and found that after 20 days in doxycycline, most or 
all cells lost XIST localization or expression, and the targeted chromo- 
somes did not silence the APP gene (Supplementary Fig. 7a, b). Thus, 
there is in vitro selection and epigenetic adaptation to circumvent 
creating a functional monosomy or nullisomy, consistent with obser- 
vations that monosomic cells do not persist in mosaic patients. 


Genome-wide silencing and methylation 

Having demonstrated allele-specific repression for the ten genes exam- 
ined above, we extended this to genome-wide expression profiling. We 
treated three transgenic clones and the parental line with doxycycline 
for 3 weeks, and compared their transcriptomes to parallel cultures 
without XIST transcription, all in triplicate. Only on chromosome 21 
is there overwhelming change, in all three clones (Fig. 4a), with ~95% 
of significantly expressed genes becoming repressed (Supplementary 
Table 2). 
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Figure 3 ——— XIST induces long-range silencing in targeted iPS cells. a, RNA 
FISH. APP RNA transcribes from three loci in uninduced cells (day 0), and is 
progressively silenced after induction (targeted chromosome 21, arrows). Scale 
bar, 2 um. b, Quantification of APP silencing. Mean + standard error, 100 
nuclei. c, Silencing for four more chromosome-21-linked genes by RNA FISH. 
Mean + standard error from 100 nuclei. d, Long-range silencing of 
chromosome 21 genes by XIST RNA. USP25 is ~21 Mb from the XIST 
integration site (black arrow). 
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Dosage compensation corrects chromosome 21 expression to near 
normal disomic levels, based on the change in total output of expressed 
genes per chromosome after XIST is induced. Because evidence suggests 
that many chromosome 21 genes are not increased the theoretical 
1.5-fold in trisomy”'”’, we also directly compared trisomic to disomic 
cells. This provides a baseline for evaluating the degree to which chro- 
mosome 21 overexpression is corrected by XIST. After XIST induction, 
overall chromosome 21 expression is reduced by 20%, 15% and 19% for 
clones 1, 2 and 3, respectively; this mirrors very well the 22% reduction 
for disomic iPS cells that lack the third chromosome 21 altogether 
(Fig. 4a). This disomic line is representative, as a similar difference 
(21%) was seen for an isogenic disomic sub-clone that we isolated from 
the trisomic parental iPS cells (not shown). Individual genes repressed 
by XIST are distributed throughout chromosome 21, as do genes over- 
expressed in trisomic versus disomic cells (Fig. 4b). In addition, RT-PCR 
confirmed repression for individually examined genes (Supplementary 
Fig. 7c). Clearly, XIST induces robust dosage compensation of most chro- 
mosome 21 genes overexpressed in trisomy. 

Trisomy 21 may have an impact on genome-wide expression path- 
ways, but differences attributable to trisomy 21 are confounded by 
genetic and epigenetic variability’. This inducible trisomy silencing system 
provides a new foothold into this important question. For example, 
even the three isogenic transgenic sub-clones show many expression 
differences (>1,000), but upon XIST induction, ~200 genes through- 
out the genome change in all three clones (but not the doxycycline- 
treated parental), most probably directly due to chromosome 21 trisomy. 
Therefore, ‘trisomy correction in a dish’ has promise as a means to 
identify genome-wide pathways perturbed by trisomy 21. 

In addition to transcriptional silencing, X-inactivation is stabilized 
by hypermethylation of promoter CpG islands****, which occurs late 
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Figure 4 | Genomic expression and methylation reveal widespread silencing 
of chromosome 21. a, Microarray: expression difference for three transgenic 
clones in doxycycline (dox) versus no doxycycline, compared to disomic line 
versus trisomic parental line. Total change in gene expression (n = 3) per 
chromosome shows chromosome 21 ‘correction’ near disomic levels, with only 
limited changes on other chromosomes. The right y axis is scaled for per cent 
gene expression change. Mean + standard deviation, in triplicate. 

b, Distribution of individual gene repression across chromosome 21. 

c, Methylation of CpG island promoters. In treated clones, 97% of chromosome 
21 genes increased by at least 5% (2-fold greater than average), compared to 
none in the parental line. P, parental line; 1, clone 1; 2, clone 2. 
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in the silencing process. Therefore, we also examined the promoter 
methylome in two genome-edited clones 3 weeks after XIST induction 
and found it largely unaltered, with one striking exception, genes on 
chromosome 21 (P value <2.2 X 10° '°) (Fig. 4c). Here, 97% of CpG- 
island-containing genes exhibited a robust increase in promoter DNA 
methylation, within the range of that seen for the inactive X chro- 
mosome” (adjusted for active/inactive chromosomes; see Methods). 
This change swept the entire chromosome, with the interesting excep- 
tion of a few genes that ‘escape’ methylation in both clones. 

In summary, data from eight different approaches demonstrate 
impressive competence of most chromosome 21 genes to undergo epi- 
genetic modification and silencing in response to an RNA that evolved 
to silence the X chromosome. 


Phenotypic correction in vitro 


Dosage compensation of chromosome imbalance presents a new 
paradigm, with opportunities to advance Down’s syndrome research 
in multiple directions, including a new means to investigate human 
Down’s syndrome cellular pathologies, which are largely unknown. 
Inducing trisomy silencing in parallel cultures of otherwise identical 
cells may reveal cellular pathologies due to trisomy 21, which could be 
obscured by differences between cell isolates. We examined cell pro- 
liferation and neural rosette formation to look for an impact on cell 
phenotype. 

There is some evidence of proliferative impairment in Down’s 
syndrome brains***; however, we observed that this varied in vitro 
between our Down’s syndrome fibroblast samples, and this would be 
highly sensitive to culture history. A clear answer emerged from com- 
paring identical cell cultures, grown with or without doxycycline for 
l week. XIST induction in six independent transgenic sub-clones 
rapidly and consistently resulted in larger, more numerous and tightly 
packed colonies in just 7 days (Fig. 5a and Supplementary Fig. 8a), 
with 18-34% more cells (Fig. 5b). Doxycycline did not enhance 
growth of the parental Down’s syndrome cells or sub-clone (Fig. 5b 
and Supplementary Fig. 8a). Thus, a proliferative impairment linked 
to chromosome 21 overexpression can be rapidly ameliorated by 
dosage compensation. 

We next examined differentiation of targeted Down’s syndrome 
iPS cells into neural progenitor cells. In 11-12 days after neural induc- 
tion of already confluent cultures, all three XIST-expressing cultures 
began to form neural rosettes, and in 1-2 days were replete with 
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Figure 5 | ‘Trisomy correction’ affects cell proliferation and neurogenesis. 
a, One week of XIST expression resulted in larger, more numerous colonies 
(representative sample). b, Changes in cell number for parental line (PL), 
parental line subclone (PL-s), and six transgenic clones (C1-C6). Mean + s.e. 
(n = 4-6). c, Corrected cultures formed neural rosettes by day 14; trisomic 
(parental and non-induced) cultures took longer (17 days). Scale bar, 100 um. 
d, Number of rosettes formed on day 14 and day 17. Mean = standard error, 
10-12 random fields in triplicate. P, parental; C1, clone 1; C3, clone 3. 
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neural rosettes (Fig. 5c), a signature of neural progenitors (confirmed 
by PAX6 and SOX] staining) (Supplementary Fig. 8b). Notably, even 
at day 14, parallel uninduced cultures remained devoid of rosettes 
(Fig. 5c). Uncorrected cultures required 4-5 more days in neural- 
induction media to fill with neural rosettes of similar size and number, 
which they did on day 17 (Fig. 5d and Supplementary Fig. 8d). There 
was no effect of doxycycline on neurogenesis in the parental line 
(Fig. 5c, d and Supplementary Fig. 8d). This marked delay in neural 
differentiation seems to be primarily independent of cell proliferation 
(Methods). Variability in the kinetics of neural differentiation 
between various iPS cell lines can obscure differences due to trisomy 
21 (ref. 26). We circumvented this using parallel cultures and on- 
demand chromosome 21 silencing, which made clear these important 
phenotypic differences. This highlights the potential of this new 
experimental model to illuminate cellular pathologies directly attri- 
butable to chromosome 21 overexpression in iPS cells and their dif- 
ferentiated progeny. 


Towards future applications 


The Supplementary Information summarizes two significant points 
relevant to potential applications and therapeutic strategies. First, we 
show that heterochromatic silencing is stably maintained, even upon 
removal of doxycycline and XIST expression (Supplementary Fig. 9a, b), 
consistent with previous studies*’. Second, although not investigated 
extensively, we targeted XIST in non-immortalized fibroblasts from a 
female Down’s syndrome patient, which generated many cells carrying 
XIST (and some heterochromatin marks) on chromosome 21 (Sup- 
plementary Fig. 9c, d). Finally, we note that our XIST transgene lacks 
X-chromosome ‘counting’ sequences, and thus is compatible with nat- 
ural female X inactivation. 


Discussion 


We set out to bridge the basic biology of X-chromosome dosage 
compensation with the pathology of chromosomal dosage disorders, 
particularly Down’s syndrome. In so doing, the present work yields 
advances that have an impact on three important areas: one basic and 
two translational. 

Although not our primary focus here, a significant impact of this 
work is that we have created a tractable, inducible system to study 
human chromosome silencing. Importantly, unlike random integration 
into a diploid cell, silencing a trisomic autosome avoids selection against 
full autosomal silencing, and this demonstrated remarkably robust 
competence of chromosome 21 to be silenced. Thus, XIST RNA evolved 
for the X chromosome uses epigenome-wide mechanisms”. The ability 
to insert a single XIST transgene in any locus provides a more powerful 
tool to study XIST function, and our effort also almost triples the size of 
transgenes that can be thus targeted for a host of other applications. 

From a translational perspective, trisomy silencing has immediate 
impact as a new means to define the poorly understood cellular path- 
ways deregulated in Down’s syndrome, and creates the opportunity to 
derive and study various patient-compatible cell types potentially 
relevant to Down’s syndrome therapeutics. Inducible trisomy silen- 
cing in vitro compares otherwise identical cultures, allowing greater 
discrimination of differences directly due to chromosome 21 over- 
expression distinct from genetic and epigenetic variation between 
transgenic sub-clones, or potentially even rare disomic sub-clones 
isolated from a trisomic population (refs 27, 28 and this study). XIST 
expression triggers not only chromosome 21 repression, but a defined 
effect on the genomic expression profile, and reverses deficits in cell 
proliferation and neural progenitors, which has implications for hypo- 
cellularity in the Down’s syndrome brain*”. This new approach can 
illuminate the cohort of genes and cognate pathways most consistently 
impacted in Down’s syndrome, to inform the search for drugs that may 
rebalance those pathways and cell pathologies. This general strategy 
could be extended to study other chromosomal disorders, such as 
trisomy 13 and 18, often fatal in the first 1-2 years. 
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Finally, the more forward-looking implication of this work is to 
bring Down’s syndrome into the realm of consideration for future 
gene therapy research. Although development of any clinical gene 
therapy is a multi-step process, any prospect requires that the first 
step, functional correction of the underlying genetic defect in living 
cells, is achievable. We have demonstrated that this step is no longer 
insurmountable for chromosomal imbalance in Down’s syndrome. 
Our hope is that for individuals and families living with Down’s 
syndrome, the proof-of-principle demonstrated here initiates mul- 
tiple new avenues of translational relevance for the 50 years of 
advances in basic X-chromosome biology. 


METHODS SUMMARY 


ZENS against the DYRK1A locus on chromosome 21 were designed and validated 
by established procedures'*. These and previously identified ZFNs to chro- 
mosome 19 AAVSI (ref. 19) were used to deliver the XIST gene and rtTA/puro 
to chromosome 21 and chromosome 19, respectively. All constructs were simul- 
taneously electroporated into a Down’s syndrome iPS cell line (DS1-iPS4) (G. Q. 
Daley, Children’s Hospital Boston)*°. Over 100 clones were isolated and 6 chosen 
for more analysis. Silencing of the targeted chromosome 21 was demonstrated by 
eight different approaches as detailed in Methods (RNA microarray, DNA methy- 
lation array, RNA FISH to chromosome 21 genes, heterochromatin hallmarks, 
qRT-PCR, gene SNP analysis, Barr body formation, RNA FISH to hnRNA). 
Down’s syndrome iPS cells were assessed for phenotypic differences (proliferation 
and neural rosette formation) before and after trisomy correction, as detailed in 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture. HT1080 TetR cells (Invitrogen) and female Down’s syndrome 
human primary fibroblast line (Coriell) (AG13902) were cultured as recom- 
mended by the supplier. Down’s syndrome iPS cell parental line (DS1-iPS4) was 
provided by G. Q. Daley (Children’s Hospital Boston)” and maintained on irra- 
diated mouse embryonic fibroblasts (iMEFs) (R&D Systems, PSC001) in hiPSC 
medium containing DMEM/F12 supplemented with 20% knockout serum replace- 
ment (Invitrogen), 1mM _ glutamine (Invitrogen), 100 1M non-essential amino 
acids (Invitrogen), 100 1M B-mercaptoethanol (Sigma) and 10 ng ml | FGF-B 
(Invitrogen, PHG0024). Cultures were passaged every 5-7 days with 1mgml * 
of collagenase type IV (Invitrogen). 

ZEN design. ZFNs against the human AAVS1 locus on chromosome 19 have 
been previously described’. ZFNs against the DYRK1A locus were designed using 
an archive of pre-validated zinc finger modules'*”’, and validated for genome 
editing activity by transfection into K562 cells and Surveyor endonuclease-based 
measurement of endogenous locus disruption (“Cell’*°*') exactly as described”. 
Southern blotting for targeted gene addition was performed exactly as described'””* 
on SphI-digested genomic DNA probed with a fragment corresponding to positions 
Chr21:38825803 + 38826056 (hg19). 

XIST and rtTA/puro plasmid construction. Fourteen-kilobase human XIST 
cDNA, a splicing isoform of full-length XIST cDNA, was subcloned into pTRE3G 
(Clontech, catalogue no. 631167). Two homologous arms (left arm, 690 bp; right 
arm, 508 bp) of DYRKIA gene on chromosome 21 were amplified by PCR from 
primary Down’s syndrome fibroblasts (AG13902) (Coriell) and cloned into the 
pTRE3G vector (human chromosome 21 DYRKIA left arm primers: forward 5'- 
GCCGTATACCATTAACTCTTTACTGTTC-3’, reverse 5’-TCTGTATACGTAA 
ACTGGCAAAGGGGTGG-3’; human chromosome 21 DYRKIA right arm pri- 
mers: forward 5’-ATTTCGCGAACGGGTGATGAGCAGGCTGT-3’, reverse 5’- 
CCGTCGCGAAAACCAGAAAGTATTCTCAG-3’). The pEFlo-3G rtTA-pA 
cassette from pEF1o.-Tet3G vector (Clontech) was subcloned into a plasmid for 
targeted gene addition to the PPPIRI2C/AAVS1 locus", which contains a unique 
HindIII site flanked by two 800-bp stretches of homology to the ZFN-specified 
position in the genome. 

Dual-targeted addition of human Down’s syndrome iPS cells and generation 
of stable targeted clones. The Down’s syndrome iPS cell line was cultured in 
10uM of Rho-associated protein kinases (ROCK) inhibitor (Calbiochem; 
Y27632) 24h before electroporation. Single cells (1 X 10”) were collected using 
TryPLE select (Invitrogen), re-suspended in 1X PBS and electroporated with a 
total of 55 1g DNA including five plasmids (XIST, DYRKIA ZFN1, DYRK1A 
ZFN2, rtTA/puro and AAVS1 ZFN) with both 3:1 and 5:1 ratios of XIST:rtTA/ 
puro. The electroporation conditions were 220V and 750 1F (BioRad Gene 
Pulser II System). Cells were subsequently plated on puromycin-resistant DR4 
MEF feeders (Open Biosystems, catalogue no. MES3948) in hiPSC medium sup- 
plemented with ROCK inhibitor for the first 24h. Over 300 colonies remained 
after 12 days of 0.4,1g ml! puromycin selection and 245 randomly chosen indi- 
vidual colonies across 36 pooled wells were examined by interphase DNA/RNA 
FISH for the presence and expression of XIST, correct targeting and retention of 
trisomy (because some subclones lacked XIST or showed just two DYRK1A DNA 
signals). Over 100 individual clones were isolated and characterized, and those of 
interest, containing targeted XIST on one of three DYRK1A loci, were frozen. Six 
single target clones with good pluripotent morphology, OCT4 positive staining, 
correct targeting to one trisomic chromosome, and good XIST RNA paint were 
expanded for further characterization. One double and one triple target line, two 
non-target clones, and one disomic clone were also isolated and frozen. Targeting 
and correct chromosome number (47) was confirmed by interphase and meta- 
phase FISH and genome integrity was confirmed by high-resolution G-band 
karyotype and CGH array. 

Chromosome preparation. iPS cells were treated with 100 ng ml’ KaryoMAX 
colcemid (Invitrogen) for 2-4h at 37 °C in a 5% COs incubator. Cells were 
trypsinized, treated with hypotonic solution, and fixed with methanol:acetic acid 
(3:1). Metaphases were spread on microscope slides, and at least 20 analysed per 
clone. Karyotype analysis was done on pro-metaphase chromosomes using 
Standard Giemsa-trypsin G band methods. 

CGH array. CGH was performed in the Cytogenetics Laboratory at University of 
Massachusetts Medical School. Genomic Microarray analysis using University of 
Massachusetts Genomic Microarray platform (Human Genome Build hg19) was 
performed with 1 pg of DNA. The array contains approximately 180,000 oligo- 
nucleotides (60-mers) that represent coding and non-coding human sequences 
and high-density coverage for clinically relevant deletion/duplication syndromes 
and the telomeric and pericentromeric regions of the genome. Data were analysed 
by BlueFuse Multi, v3.1 (BlueGnome, Ltd). 

DNA/RNA FISH and immunostaining. DNA and RNA FISH were carried out 
as previously described'®'*'***. The XIST probe is a cloned 14-kb XIST cDNA 
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(the same sequence as XIST transgene in Fig. 1b) in pGEM-7Zf(+) (Promega). 
Six chromosome 21 gene probes are BACs from BACPAC Resources (DYRKIA, 
Rp11-105024; APP, RP11-910G8; USP25, RP11-840D8; CXADR, RP11-1150114; 
ITSN1, RP11-1033C16; COL18A1, RP11-867018). DNA probes were labelled by 
nick translation with either biotin-11-dUTP or digoxigenin-16-dUTP (Roche). In 
simultaneous DNA/RNA FISH (interphase targeting assay), cellular DNA was 
denatured and hybridization performed without eliminating RNA and also 
treated with 2 U pl! of RNasin Plus RNase inhibitor (Promega). For immuno- 
staining with RNA FISH, cells were immunostained first with RNasin Plus and 
fixed in 4% paraformaldehyde before RNA FISH. Antibodies were as follows: 
H3K27me3 (Millipore, 07-449), UbH2A (Cell Signaling, 8240), H4K20me 
(Abcam, ab9051), macroH2A (Millipore, 07-219), OCT4 (Santa Cruz, sc-9081), 
PAX6 (Stemgent, 09-0075), SOX1 (R&D Systems, AF3369). 

Allele-specific SNP analysis. Primers were designed to amplify 3’ untranslated 
regions of chromosome 21 genes reported to contain SNPs (Supplementary Table 
3). Total cDNA was used from three transgenic clones with and without XIST 
induction for 22 days. RT-PCR products were sequenced by GENEWIZ. Of ~ 10 
genes examined, four were heterozygous and informative in the patient Down’s 
syndrome iPS cell line used here. 

Microarray analysis. Three independently targeted subclones plus the parental 
chromosome 21 trisomic (non-targeted) iPS cell line were grown with or without 
doxycycline (2 ug ml ') for 22 d. Normal male iPS cell and disomic isogenic lines 
were also cultured for 22 d and total RNA was extracted with a High Pure RNA 
extraction kit (Roche) in triplicate for each, processed with a Gene Chip 3’ IVT 
express kit (Affymetrix), and hybridized to Affymetrix human gene expression 
PrimeView arrays. Array normalization was performed with Affymetrix 
Expression Console Software with Robust Multichip Analysis (RMA)**. Probe 
sets with the top 60% of signal values were considered present and ‘expressed’ and 
were used for all further analysis. Data in Fig. 4 has no other threshold applied. 
When designated, a gene expression change significance threshold was applied 
using a two-tailed t-test comparing samples with or without doxycycline in trip- 
licate (n = 3) (Supplementary Table 2, P< 0.01). For the ~200 genes found to 
significantly change in all three clones (in text), a t-test with P<0.001 was 
applied. 

Microarray data interpretation. Using extraction-based methods, changes on 
just one of three alleles (from the XIST-bearing chromosome) will be diluted by 
the other two. If all three chromosomes are fully expressed, this would predict a 
33% reduction in chromosome 21 expression levels per cell when one chro- 
mosome 21 is fully silenced. However, 33% would apply only if chromosome 
21 genes are fully overexpressed to start, and previous evidence and results in this 
study show this is not the case for many genes. Previous microarray studies have 
analysed expression levels of chromosome 21 in Down’s syndrome patient cells, 
although such analyses are hampered by the extensive genetic and epigenetic 
differences between any two individuals’. The fraction of chromosome 21 genes 
detected as overexpressed varies with the study and tissue, but generally is in the 
19-36% range*”’, with individual gene increases often in the ~1.2-1.4 range (less 
than the theoretical 1.5). For example, one study of Down’s syndrome embryoid 
bodies showed that only 6-15% of genes appeared significantly upregulated, but 
this was comparing non-isogenic samples of different ES cell isolates”. 

Our trisomy correction system allows direct comparison of the same cells 
grown in identical parallel cultures, with and without XIST-mediated chro- 
mosome silencing. Our data show a ~20% reduction in chromosome 21 express- 
ion overall; importantly, this level of reduction is seen either when the third 
chromosome is silenced in trisomic cells, or when disomic and trisomic cells 
are compared. This 20% reduction represents an average per cell for all three 
chromosomes, but corresponds to a 60% reduction in expression for just one 
chromosome 21 (the one silenced by XIST RNA, as shown here). 

Apart from our goal here of trisomy dosage compensation, these results add 
significantly to understanding the extent of chromosome 21 overexpression in 
Down’s syndrome, by providing a more comprehensive analysis that shows that 
expression of most genes is increased, but less than the theoretical 1.5 fold. 
qRT-PCR. qRT-PCR was performed for eight downregulated chromosome 21 
genes determined by microarray on a Bio-Rad MyiQ real-time PCR detection 
system in triplicate for clone 3 with/without doxycycline treatment for 22 d. The 
B-actin gene was used as an internal standard for calculation of expression levels. 
Primers for eight chromosome 21 genes and f-actin were described in 
Supplementary Table 4. 

DNA methylation analysis. The parental line and two independent targeted lines 
were grown with and without doxycycline for 22 d, in duplicate cultures. Genomic 
DNA was extracted using PureLink Genomic DNA mini kit (Invitrogen) and 
750 ng bisulphite modified with the Alternative Incubation Conditions from the 
EZ DNA methylation kit (Zymo Research). 160 ng of bisulphite DNA was amp- 
lified, fragmented and hybridized to Illumina Infinium HumanMethylation450 
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array following the standard protocol as outlined in the user guide. CpG islands 
were defined as high and intermediate CpG densities using the CpG density 
classifications based on those used previously’. The program CpGIE was used 
to locate HC and IC islands on the X chromosome and chromosomes 21 and 22. 
When multiple probes in CpG islands were associated with the same TSS, an 
average genic methylation value was calculated. These average genic values were 
compared before and after doxycycline induction using the Mann-Whitney 
U-test. Analysis was based on CpG islands within promoters of 143 chromosome 
21 genes (Fig. 4c). 

The average methylation value was 6% on chromosome 21 before XIST induc- 
tion, and increased to 20-21% in both subclones after induction. Because any 
methylation increase on the transgenic chromosome would be diluted by the 
presence of three chromosome 21 copies, this suggests the range of 60% methyla- 
tion on the one XIST-coated chromosome, which is within the range seen for the 
inactive X chromosome”. 

Cell proliferation analysis. Eight different iPS cell lines (parental line, one non- 
targeted subclone, and six independent targeted subclones) were passaged onto 
6-well plates at equal cell densities per well of each line and grown with or without 
doxycycline for 7d. At least four replicates of each line were analysed in two 
independent experiments. Rigorous measures were taken to minimize and con- 
trol for any minor variations in seeding densities of iPS cells, which cannot be 
plated as single cell suspensions. First, the analysis was done twice for six different 
transgenic clones, in each case comparing triplicate plates of corrected versus not 
corrected (doxycycline versus no doxycycline). To avoid differences in plating 
efficiencies of doxycycline and no doxycycline cells, we performed the experi- 
ments over a time course that did not require passage. For each of the six trans- 
genic clones, the parental line and one negative control (non-targeted) subclone, a 
single well of Down’s syndrome iPS cells (without doxycycline) was used to 
generate a cell suspension (cells and small disaggregated clumps). Next, equal 
aliquots of the cell suspension were plated into each of six wells four times (not 
relying on one measurement but the average of four for seeding each well). After 
plating, doxycycline was added to three of the six wells, and the cultures were 
maintained for 7 d. For images, plates were fixed, stained with 1 mg ml! crystal 
violet (Sigma) in 70% ethanol for 30 min and scanned to generate TIFF images. 
For cell counts, single cells were collected by TryPLE select and counted using 
Beckman Coulter Z1 Particle Counter. 

Differentiation of neural progenitors and irreversibility in cortical neurons. 
For differentiation, independent XIST-transgenic iPS cell clones and the parental 
Down’s syndrome iPS cell line were dissociated with Accutase (Innovative Cell 
Technologies) and 4 X 10° single cells were plated on Matrigel-coated 6-well 
plates in mTeSR1 medium (Stemcell technologies). Once the cell culture reached 
90-100% confluence, neural induction was initiated by changing the culture 
medium to neural induction medium, a 1:1 mixture of N2- and B27-containing 
media supplemented with 500 ng ml’ noggin (R&D Systems), 10 4M SB431542 
(Tocris Bioscience), and 1 uM retinoic acid (Sigma, catalogue no. R2625), with/ 
without treatment of doxycycline for the specified times. The neural rosettes were 
counted and their diameter measured for at least 300 rosettes (sampled in random 
areas from triplicate dishes). At day 14, the doxycycline-induced culture had an 
average rosette diameter of 142 um + 0.55 pm in clone 1 and 141 um + 3.49 um 
in clone 3. Rosettes could not be measured at the same time point in the uncor- 
rected culture, as they had not formed. At day 17, the uncorrected culture had 


neural rosettes of similar number and size for both clones 1 (140 um + 0.87 um) 
and 3 (140 1m = 1.09 jm). The corrected culture could not be accurately com- 
pared for day 17 because the rosettes had become so mature and often had 
merged. After 17 d, neural rosettes were collected by dissociation with dispase 
and replated on poly-ornithine and laminin-coated plastic dishes in N2- and B27- 
containing media including 20ngml~' FGF2. After a further 2d, FGF2 was 
withdrawn to promote differentiation of cortical neurons. To test for the irre- 
versibility of silencing, two independent clones were differentiated to cortical 
neurons in the presence of doxycycline for 70 days to initiate silencing. They were 
then split into parallel cultures grown with and without doxycycline for another 
30 days, and XIST and APP expression analysed by RNA FISH. 

Targeted addition to primary fibroblasts. We used non-immortalized primary 
human female Down’s syndrome fibroblasts, which like all primary fibroblasts 
have a limited lifespan in culture (potentially more limited for Down’s syndrome 
fibroblasts). We reasoned that the robustness of ZFN-driven editing, combined 
with reduction to disomy for the DRYK1A gene, may make it possible to observe 
some edited cells before they senesce. We used a transgene carrying near full- 
length (~14 kb) XIST cDNA under a TetO, inducible promoter, and a selectable 
marker on the same construct, with ~600-bp homology arms to the DYRK1A 
gene (vector is ~21 kb, with a ~17-kb insert) (data not shown). When introduced 
without the Tet-repressor construct, the TetO2 CMV promoter is constitutively 
active. Two ZFN-containing vectors and the 21-kb XIST transgene were trans- 
fected into primary DS fibroblasts (Coriell AG13902) using Stemfect polymer 
(Stemgent) (10:1 ratio of XIST to ZEN, and 13 pg DNA to 1.3 pl Stemfect per well 
of 6-well plate). The frequency of stable integrants was such that a sparse mono- 
layer of transgenic fibroblasts emerged, rather than a few individual colonies 
following selection with hygromycin (75 jg ml‘). The pooled population of selected 
cells was analysed by FISH and immunostaining for targeting, XIST expression and 
heterochromatin marks. XIST RNA was observed over the DYRK1A locus in 
~74% of cells, indicating accurate transgene targeting, which was also verified 
by metaphase FISH (Supplementary Fig. 9c). In many cells there was notable 
enrichment of H3K27me, UbH2A and H4K20me heterochromatic marks 
(Supplementary Fig. 9d). Owing to the limited lifespan of primary cells and the 
progressive silencing of the CMV promoter used in this construct, these cells were 
not more fully characterized. 
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Coordination of bacterial proteome with 
metabolism by cyclic AMP signalling 


Conghui You’, Hiroyuki Okano**, Sheng Hui'**, Zhongge Zhang’, Minsu Kim, Carl W. Gunderson”, Yi-Ping Wang", 
2, 


Peter Lenz”, Dalai Yan° & Terence Hwa! 


The cyclic AMP (cAMP)-dependent catabolite repression effect in Escherichia coli is among the most intensely studied 
regulatory processes in biology. However, the physiological function(s) of cAMP signalling and its molecular triggers 
remain elusive. Here we use a quantitative physiological approach to show that cAMP signalling tightly coordinates the 
expression of catabolic proteins with biosynthetic and ribosomal proteins, in accordance with the cellular metabolic needs 
during exponential growth. The expression of carbon catabolic genes increased linearly with decreasing growth rates upon 
limitation of carbon influx, but decreased linearly with decreasing growth rate upon limitation of nitrogen or sulphur influx. 
In contrast, the expression of biosynthetic genes showed the opposite linear growth-rate dependence as the catabolic genes. A 
coarse-grained mathematical model provides a quantitative framework for understanding and predicting gene expression 
responses to catabolic and anabolic limitations. A scheme of integral feedback control featuring the inhibition of cAMP 
signalling by metabolic precursors is proposed and validated. These results reveal a key physiological role of cAMP- 
dependent catabolite repression: to ensure that proteomic resources are spent on distinct metabolic sectors as needed in 
different nutrient environments. Our findings underscore the power of quantitative physiology in unravelling the underly- 


ing functions of complex molecular signalling networks. 


Biological organisms use a myriad of signalling pathways to monitor 
the environment and adjust their genetic programs in accordance 
with environmental changes'*. Understanding how the signalling 
systems perceive the environment and orchestrate the downstream 
genetic changes is a grand challenge of systems biology**. One of the 
earliest signalling systems ever discovered in modern biology is the 
cyclic AMP (cAMP)-dependent pathway in E. coli”’°. This system is 
known to mediate carbon catabolite repression (CCR)"', a ubiquitous 
phenomenon among microorganisms whereby the synthesis of cata- 
bolic proteins is inhibited when growing on glucose or other rapidly 
metabolizable sugars’*’’. In E. coli, it was long established that the 
uptake of glucose inhibited the synthesis of cAMP”, which is required 
for the expression of many catabolic genes through its activation of 
the pleiotropic regulator Crp (the cAMP receptor protein)™*. 
Despite extensive studies on this cAMP signalling pathway, a num- 
ber of basic issues remain unanswered even for this well-known system: 
although the inhibitory effect of glucose uptake on cAMP synthesis 
via the phosphotransferase system (PTS) is well established'*'® (Sup- 
plementary Fig. 1), the growth of E. coli cells on various PTS-independent 
sugars also showed reduced cAMP levels'”"'’. Moreover, limitation of 
nitrogen, phosphorous, and other elements also led to much reduced 
cAMP levels*””’. Elucidating the physiological signal(s) that triggered 
cAMP signalling in those conditions was in fact the pursuit of Monod’s 
last paper”’, but these signal(s) remain elusive. Also, it is unclear to 
what extent cAMP signalling is intended for implementing CCR, as 
hierarchical carbon usage”’, a behaviour widely associated with CCR”, 
was shown to be independent of cAMP in several studies**”’”. Thus, the 
true physiological function of cAMP signalling in E. coli remains open 
nearly 50 years after its discovery. In this study, we describe a top-down 
approach which first addresses quantitatively the physiological function 


of cAMP signalling—not for CCR per se as we will show, but for the 
coordination of metabolism with proteomic resource allocation. This 
knowledge is then used as a guide to reveal the signalling strategy and 
mechanism by which E. coli cells trigger cAMP signalling. 


Catabolic genes show linear response 

To characterize the physiological consequences of cAMP-dependent 
signalling, we used the well-studied lac system of E. coli, one of the 
many catabolic operons activated by the Crp-cAMP complex’***. 
Wild-type E. coli K-12 cells were grown in minimal medium batch 
culture with saturating amount of one of many carbon sources, with 
the Lac repressor (Lacl) deactivated by the inducer isopropyl-f-p- 
thiogalactoside (IPTG). As such, the native LacZ expression indicated 
the degree of cAMP signalling”. Supplementary Table 1 lists the LacZ 
expression level (L) together with the growth rate (A) for cells growing 
in each medium. A scatter plot of the data shows a striking linear 
relation (solid circles in Fig. la). The same relation is obtained by 
limiting the carbon influx, whether by titrating the lactose permease 
for cells growing on lactose (Supplementary Table 2, solid triangles in 
Fig. 1a), or by titrating the glycerol uptake system for cells growing on 
glycerol (Supplementary Table 3, solid diamonds in Fig. la); see 
Supplementary Figs 2, 3 and Supplementary Table 4 for strain details. 
The red solid line is the best-fit of all data to the form 


L= Lmax(1 - 2/2) (1) 


referred to below as the ‘C-line’; see Supplementary Table 5 for fitted 
values of the x-intercept Ac and y-intercept Lax. Similar linear relations 
are seen (Supplementary Fig. 4) for /ac in various other strains of E. coli 
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Figure 1 | Catabolic and biosynthetic gene expression under nutrient 
limitations. For clarity, growth conditions and depository of data are 
summarized in the legend table. Number in parentheses indicate the 
Supplementary Tables containing the parameters of the best-fit lines. 

a, Correlation of PlacZ expression with the growth rate under C-limitations 
(solid symbols) and NC-limitations (open symbols). The left y-axis shows LacZ. 
expression per OD¢oo (Miller unit or MU), which is proportional to per total 
protein; see Supplementary Methods and Supplementary Fig. 14. The right 
y-axis shows the equivalence in normalized fractions f- (Supplementary Note 3). 
b, Internal cAMP level as indicated by the cAMP excretion rate (Supplementary 
Fig. 11) for cells grown under C- and NC-limitations (filled and open symbols, 
respectively). c, RNA/protein ratio for both C- and NC-limited growth 

(filled and open symbols, respectively). The right y-axis shows the equivalence in 
normalized fractions fy (Supplementary Note 3). d, PginA-lacZ expression under 
C- and A-limited growth (filled and open symbols, respectively). The right y-axis 
shows the equivalence in normalized fractions f, (Supplementary Note 3). 


characterized here and previously”, and for different catabolic pro- 


moters (solid symbols in Supplementary Fig. 5). 

The data above indicate that the C-line is a common response pat- 
tern shown by the catabolic genes—a gradual increase upon reduction 
in carbon influx or for less rapidly metabolized carbon compounds. 
We refer to such carbon-dependent mode of growth limitation as “C- 
limitation’. To show that the C-line is a response specific to changes in 
carbon catabolism rather than to general growth parameters***’, we 
modulated cell growth by various non-carbon (NC) modes of growth 
limitation, including continuous culture with ammonium or sulphate 
limitation (Supplementary Table 6), and batch culture using a titrata- 
ble nitrogen uptake system (Supplementary Table 7). The latter system 
bypasses the highly regulated nitrogen response system™, creating a 
situation in which the formation of amino acids from their respective 
carbon can be directly titrated by the expression of glutamate dehy- 
drogenase (GDH); see Supplementary Fig. 6. The expression of the 
lacZ promoter (PlacZ) obtained for these modes of NC-limited growth 
is plotted against the growth rates in Fig. 1a and Supplementary Fig. 7 
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for several exemplar carbon sources (open symbols). In each case, a 
positive linear relationship was observed (dashed lines). Similar rela- 
tions are seen for several other catabolic promoters (open symbols in 
Supplementary Fig. 5d, e). We refer to these positive linear relations as 
the ‘NC-lines’. Both C- and NC-lines also described the expression of a 
PlacZ-gfp fusion reporter in wild-type cells grown in microfluidic 
devices with rapid medium exchange” to maintain low nutrient levels 
and remove excretion products (Supplementary Fig. 8). 

We next examined the dependence of the C- and NC-lines on Crp- 
cAMP-mediated activation. Upon replacing the lacZ promoter by 
the JacUV5 promoter, whose independence of Crp-cAMP is well 
established**, LacZ expression in these strains showed little growth- 
rate dependence under both C- and NC-limited growth (Supplemen- 
tary Fig. 9). Additionally, we characterized the expression of a synthetic 
lacZ promoter with the Crp site scrambled, and found only a weak 
growth-rate dependence, resembling that previously reported for con- 
stitutive gene expression*’, under both C- and NC-limitations (Sup- 
plementary Fig. 10). Thus both the C- and NC-lines rely completely on 
Crp-cAMP-mediated gene regulation. 

We also characterized the cAMP excretion rate (Supplementary 
Fig. 11), taken to reflect the internal cAMP concentration”. It is seen 
to correlate well with PlacZ expression in the respective medium for 
both C- and NC-limited growth (Supplementary Fig. 12, Supplemen- 
tary Tables 8 and 9). Moreover, scatter plot of cAMP excretion rates 
against the corresponding growth rates (Fig. 1b) resembles the C- and 
NC-lines of PlacZ (Fig. 1a). Together, these data strongly indicate that 
the C- and NC-lines are two sides of the same coin, both regulated by 
Crp-cAMP primarily through cAMP signalling. 


Opposing linear response by anabolic genes 

To understand the linearity of catabolic gene expression with the 
growth rate, we recall a linear relation between the cellular ribosome 
content and the growth rate for exponentially growing bacteria**?”**. 
This relation (a bacterial growth law), which arises due to the alloca- 
tion of ribosomes as a key growth-limiting resource**”*, is verified for 
our strains by characterizing the RNA/protein ratio (r) under both C- 
and NC-limited growth (Supplementary Tables 10 and 11, Fig. 1c). 
The striking contrast between the linear increase in catabolic gene 
expression (solid red symbols in Fig. 1a) and the concomitant linear 
decrease in the ribosome content (Fig. 1c) upon C-limitation (solid 
green symbols in Fig. 1c) indicates that these phenomena might share 
a common origin. 

This line of qualitative reasoning predicts that, upon NC-limitation 
where the expression levels of catabolic and ribosomal genes both 
decrease linearly as fractions of total protein (open symbols in 
Fig. la, c), the expression of other genes (for example, biosynthetic 
genes) should increase linearly with the growth rate. We tested this 
prediction by characterizing the expression of glnA, which encodes 
the major ammonia assimilating protein glutamine synthetase” using 
the titratable GDH system under a fixed carbon source (glycerol). As 
expected, the expression level of the chromosomal PglnA-lacZ fusion 
reporter (open blue diamonds in Fig. 1d, denoted as G) had a negative 
linear correlation with growth rate; similar behaviours were seen for 
the expression of several other genes in amino acid synthesis pathways 
(Supplementary Fig. 13). These responses suggest a general amino 
acid limitation imposed by the titratable GDH system (Supplemen- 
tary Fig. 6), referred to henceforth as “A-limitation’ and the resulting 
response as the ‘A-line’ (dashed blue line in Fig. 1d). The latter is 
described mathematically by 


G= Gmax(1 a Alda) (2) 


with an x-intercept 44 and a y-intercept Gmax (Supplementary Table 15). 
Interestingly, PginA-lacZ also had a positive linear correlation with the 
growth rate in C-limited growth (solid blue line in Fig. 1d). Thus E. coli 
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perceives carbon limitation as a nitrogen excess, a feature reported 
qualitatively long ago in Klebsiella”. 


Quantifying the proteome fractions 

To see whether the response of the catabolic genes can be quantitatively 
accounted for by changes in the biosynthetic genes and the ribosome 
content, we apply and extend a mathematical model of proteome 
partition introduced recently’*” to relate various growth laws to the 
global regulation of gene expression. As illustrated in Fig. 2a and 
explained in detail in Supplementary Note 1, we propose that the 
proteome is comprised of several major ‘sectors’, that is, groups of 
genes whose expression share similar growth-rate dependences upon 
various modes of growth limitation. These include a catabolic sector 
(C) of mass fraction ¢ that increases specifically in response to 
C-limitation as exemplified by lacZ (Fig. la and Supplementary Fig. 5), 
an anabolic sector (A) of mass fraction ¢, that increases specifically in 
response to A-limitation as exemplified by glnA (Fig. 1d, Supplemen- 
tary Fig. 13), and a ribosomal sector (R) of mass fraction #, that 
increases specifically in response to translational inhibition (Supplemen- 
tary Fig. 15). To account for additional sectors which have not been 
specifically targeted by the various modes of growth limitation applied 
in this study, we propose an uninduced sector (U) of mass fraction dy 
which generally decreases with decreasing growth rate. Examples of the 
latter might include enzymes for sulphur assimilation and nucleotide 
synthesis (Supplementary Fig. 16). 

As shown by the various reporters, we expect each of the above 
proteome sector j to contain a growth-rate dependent part Ad,(A) and 
a growth-rate independent offset $j, with $(2) = jo + Agj(A); see 
Supplementary Fig. 17. Additionally, there may exist a growth-rate 
independent sector (I) of mass fraction #;. Together, the growth-rate 
independent el a can be lumped into an effective sector (Q) 
of mass fraction ¢g=4,+ 7,4; as suggested from a previous 
study*’, providing a cap bras = i= ¢q on how large the specific 
responses of each sector can increase to, that is, yi Ag; (A) = @ max 
We estimated ¢nax ~ 43% using protein overexpression ‘data (Sup- 
plementary Fig. 18 and Supplementary Note 2). 

The constraint that the Agjs add up to a constant (@max) imposes 
quantitative relations between the responses of the various sectors 
upon different modes of growth limitation. This constraint is most 
simply expressed (Supplementary Note 3) in terms of a normalized 
fraction, f(A) = Adj(A)/dmax (see equation $21 in Supplementary 
Information) for each sector j, as 


f(A) + fal) + fal) + fold) = 1 (3) 


Assuming that changes in the expression levels of PlacZ and PginA 
report changes in the C- and A-sectors, respectively, that is, AL « 
Ad (A) and AG « Ada(A), the normalized proteome sectors fc(A) 
and f(A) can be estimated from the reporter expression levels using 
our model (Supplementary Fig. 17). The results are indicated on the 
right vertical scales in Fig. 1a, d. Similarly, fg(/) can be obtained from 
the RNA/protein ratio r(A) (right vertical scale of Fig. lc and 
Supplementary Fig. 15a). The values of these normalized fractions 
under C-limited growth are plotted together as coloured circles in 
Fig. 2b, and their best linear fits are shown as lines of corresponding 
colours. Since fo(A) + fa(A) + fa(A) < 1 (black open circles and line), 
we deduce from equation (3) the existence of an uninduced sector U 
whose abundance, fy(A), is shown by the purple line, it is approxi- 
mately 30% of the R-sector (green line). 


Tight coordination of proteome sectors 

The constraint equation (3) suggests that the growth-rate depen- 
dences of the C- and A-sectors are simply related, given that fa(A) 
follows the growth law (Fig. lc) and fy ~ 0.3fg. Applying this to 
A-limited growth, we can use the observed expression of PlacZ under 
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A-limitation (Fig. 1a) to predict PginA expression under A-limitation; 
the result is in good agreement with the best fit as shown in Fig. 2c 
(black line), with glycerol as the carbon source. This is repeated for 
A-limitation on glucose, where we expect the smaller C-sector for 
glucose to lead to an expanded A-sector, and hence an increased 
intercept 2, for PglnA. This prediction is verified in Supplementary 
Fig. 19, with the data quantitatively accounted for by the predicted 
A-line (black line). Similarly, the value of the intercept Ac for PlacZ 
under C-limitation with a fixed degree of A-limitation is reduced to the 
expected range (Supplementary Fig. 20). See Supplementary Fig. 21 
and Supplementary Table 16 for the direct comparison of the predicted 
and best-fit values of Ac, 44. As one additional test of this picture, we 
applied sub-lethal amounts of antibiotics to probe the effect of trans- 
lational limitation on the various proteome sectors (Supplementary 
Fig. 22). Whereas the R-sector increased linearly with decreasing 
growth rate*’, PlacZ and PginA expressions both decreased linearly 
with decreasing growth rate as predicted, again in a manner that con- 
forms quantitatively to the constraint equation (3). 

These data sets (Fig. 2b, c, Supplementary Figs 19, 20) can also be 
displayed simultaneously on 3D plot (Fig. 2d); the data are seen to lie 
close to a plane, defined by equation (3) with fy ~ 0.3fg. This is an 
example of the Pareto surface*', which results generally from trade-off 
between competing objectives that cannot be simultaneously opti- 
mized. The linear dependences of these fractions on the growth rate 
further indicate that they arise from protein synthesis being the common 
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Figure 2 | Proteome fractions and the partition model. a, Illustration of the 
proteome partition model. Upon C-limitation, the C-sector increases and the 
A-, R-, U-sectors decrease, whereas upon A-limitation, the A-sector increases 
and the C-, R-, U-sectors decrease; see Supplementary Note 1. b, Normalized 
responses (fc, fa, fx) indicated by the C-, A-, R- sector reporters upon 
C-limitation (solid red, green, blue circles respectively from Fig. la, c, d). The 
black circles show the sum fc + fa, + fg at each growth rate; they decrease 
linearly with the growth rate (black line). The purple line is the predicted 
U-sector fraction fy based on equation (3). ¢, fo, fa, fr upon A-limitation with 
glycerol as the carbon source (open red, green, blue diamonds, respectively. 
from Fig. la, c, d). Taking fy (purple line) to be the same as that in b (fy = 0.3fg), 
equation (3) predicts f, to follow the black line, and blue line shows the best fit. 
d, fc, fa, fr for the four sets of C- and A-limited growth conditions characterized 
(b, cand Supplementary Figs 19, 20) are plotted in a 3D plot. (Unavailable data 
for fr, fa are generated from the straight line fit.) Two views are shown. The data 
are seen to fall on the predicted surface (equation (3) with fy = 0.3fg). 
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bottleneck of cell growth. For example, catabolic proteins are needed to 
increase carbon influx while ribosomal and anabolic proteins are needed 
to increase biosynthesis, all of which depend on translation by ribosomes 
(Supplementary Note 4). This logic leads to an important physiological 
rationale for the pleiotropic regulation of catabolic genes by Crp-cAMP: 
given rapidly metabolizable carbons, cells reduce catabolic gene expres- 
sion to increase the expression of ribosomal and anabolic genes for rapid 


cell growth. 


Strategy of metabolic-proteome coordination 

The opposing behaviours shown by the C- and A-sectors upon dif- 
ferent modes of growth limitation (Figs 1 and 2) point strongly to the 
existence of control system(s). Figure 3a describes a simple regulatory 
scheme wherein a single signal can sense and eliminate imbalance 
between metabolic activities on the carbon and nitrogen side. In this 
coarse-grained scheme, the external carbon source is converted to the 
pool of carbon precursors (K), for example, «-ketoacids, which subse- 
quently combine with ammonium to form amino acids (a) via trans- 
amination reactions (Supplementary Fig. 6). If the carbon influx (Jc) 
exceeds the nitrogen influx (Jj), then precursors (K) will accumulate? ™. 
A signal from K to inhibit Crp-cAMP activity (dotted red line in 
Fig. 3a) can account for the observed responses by the catabolic genes 
(Fig. la and Supplementary Fig. 5), both for increases in Jc (relief of 
C-limitation) and decreases in Jy (A-limitation). An opposite signal 
from K could underlie changes in the A-sector (dashed blue arrow). 
Detailed mathematical analysis (Supplementary Note 5) shows how 
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Figure 3 | Transient repression by metabolic precursors. a, A coarse-grained 
view of metabolism, focusing on the biosynthesis of amino acids from the 
carbon and nitrogen influxes (Jc and Jy respectively). Carbon precursors such 
as o-ketoacids (K) sense the difference between Jc and Jy. An integral feedback 
scheme using the regulation of catabolic and anabolic enzymes (¢¢ and $y, 
respectively) by K can coordinate these metabolic sectors in a parameter-free 
manner; see Supplementary Note 5 for details. b, PlacZ-lacZ expression was 
characterized for wild-type NCM3722 cells grown exponentially in various 
carbon sources and with 1 mM IPTG to deactivate Lacl. At time zero, 20 mM 
OAA was added; a transient repression period of ~30 min is shaded in grey. 
PlacUV5-lacZ expression in strain NQ1053 was characterized in the same way 
(black squares; right y-axis). c, LacZ expression levels before and during the 
repression period (Fig. 3b and Supplementary Figs 23-27) are summarized by 
the open and grey bars. Striped bars show the results of PlacUV5-lacZ. d, cAMP 
concentrations in the medium were monitored for wild-type cells grown in 
glycerol and the two PTS-deletion strains, NQ721 (Apts) and NQ506 (A5EI 
Apts), grown in lactose. 20 mM OAA was added at time zero. e, Relative cAMP 
excretion rates were quantified before and during the repression period; see 
Supplementary Method. In c and e, data were expressed as mean + s.e.m. 
(n= 3). 


304 | NATURE | VOL 500 | 15 AUGUST 2013 


9935,45 


such a feedback loop (known as “integral feedback since K is the 
time-integral of the flux difference between Jc and Jy being regulated), 
together with global constraints provided by proteome partition, can 
generate the C- and A-lines generically, independent of the detailed 
forms of the regulatory functions. 

The proposed integral feedback strategy can be tested by adding a 
precursor to an exponentially growing culture. If the uptake of precursor 
significantly exceeds its metabolic turnover, then the transient build-up 
of the precursor should reduce the internal cAMP level and the expres- 
sion of catabolic genes, until a later time when metabolic fluxes are re- 
equilibrated. We first tested oxaloacetate (OAA), a major metabolic 
precursor (an o-ketoacid) which is co-used with other carbons by wild- 
type cells to give faster steady-state growth (Supplementary Table 17). 
The addition of OAA to cells growing exponentially in glycerol, glucose 
or lactose strongly inhibited PlacZ expression (with Lacl deactivated) 
for up to 30 min (Fig. 3b, c and Supplementary Fig. 23). This transient 
effect was Crp-cAMP-dependent because no effect was observed for 
PlacUV5 (Fig. 3b, c) and another Crp-independent promoter (Sup- 
plementary Fig. 23f). The inhibitory effect is most probably exerted on 
cAMP synthesis, because the excretion of cAMP, which reflects the 
internal cAMP level’’, was also inhibited over the same period upon 
OAA addition (Fig. 3d, e). Thus, OAA is a candidate precursor imple- 
menting the feedback inhibition depicted in the dotted red line of 
Fig. 3a. 

Strong to moderate repressive effects were also observed following 
the addition of «-ketoglutarate (AKG) (Fig. 3c and Supplementary 
Figs 24, 25) and pyruvate (PYR) (Fig. 3c and Supplementary Fig. 26), 
both of which are o-ketoacids. In contrast, the addition of succinate 
(SUCC), which is a dicarboxylic acid like OAA and AKG, but is not an 
a-ketoacid, leads to only minor repression (Fig. 3c and Supplemen- 
tary Fig. 27). Although our study does not rule out other possible 
causes of transient repression”, the inhibitory effects produced by 
the a-ketoacids are much stronger than those by glucose and the 
non-hydrolysable compound methylthio-B-galactoside (TMG) investi- 
gated previously”’; see Supplementary Fig. 28. Finally, «-ketoacid addi- 
tion also caused transient increase in the expression of the anabolic gene 
glnA (Supplementary Fig. 29a, b). This regulatory effect is independent 
of cAMP (Supplementary Fig. 29c, d, e), and is captured by the dashed 
blue arrow in Fig. 3a. 


Mechanism of metabolic-proteome coordination 


The cAMP signalling pathway linking metabolism to catabolic gene 
expression is commonly thought to comprise of the PTS proteins'*’® 
(Supplementary Fig. 1). Recently, AKG was reported to interact 
directly with enzyme I (EI) of the PTS proteins”, making PTS proteins 
likely candidates for implementing the proposed feedback by precur- 
sors (dotted red line in Fig. 3a). However, strong transient repression 
was still observed upon the addition of a-ketoacids in strains with 
deletion of various PTS proteins, including one with the 3 major PTS 
proteins deleted (Apts) and another with additionally all 5 EI proteins 
deleted (A5EI Apts); see Fig. 4a, b and Supplementary Figs 30, 31. This 
transient effect was still accompanied by similar fold of reduction in 
cAMP excretion in vivo (Fig. 3d, e), and the PTS mutants in steady 
state still showed C-lines for C-limited growth (Fig. 4c). The C-lines of 
the PTS mutants (dashed lines) showed similar x-intercept (A-) as the 
wild type (red line), indicating that although PTS proteins do affect 
the degree of Crp-cAMP-mediated regulation of catabolic gene 
expression as is long known”, they are not necessary for the existence 
of this response (that is, the occurrence of the C-line). 

Next we investigated directly the inhibitory effect of various can- 
didate metabolites on cAMP synthesis, by in vitro assays with per- 
meabilized cells (ref. 48 and Supplementary Methods). The rate of 
cAMP synthesis by adenylate cyclase was significantly reduced by 
several o.-ketoacids but not by SUCC, in either PTS* or PTS” back- 
ground (Fig. 4d and Supplementary Fig. 32). Thus, the simplest regu- 
latory route consistent with all data are to have precursors such as 
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Figure 4 | Mechanism of cAMP-dependent signalling. a, OAA transiently 
repressed PlacZ-lacZ expression in PTS-deleted cells, NQ721 (Apts) and 
NQ506 (A5EI Apts), grown exponentially in lactose. 20 mM OAA was supplied 
at time zero. The response of WT cells to OAA in Fig. 3b (red triangles) was 
plotted for comparison. b, The magnitudes of transient repression by OAA and 
AKG were quantified as in Fig. 3c, based on the data in Fig. 4a and 
Supplementary Figs 30, 31. c, Steady state PlacZ-lacZ expression in PTS 
mutants under various modes of C-limitations; see Supplementary Table 18 for 
data and conditions. Dashed lines show the best linear fit (Supplementary Table 
19). The C-line of Fig. 1a is shown in red for reference. d, in vitro adenylate 
cyclase activities in strains NQ385 (pts), NQ976 (Apts) and NQ977 (A5EI 
Apts) were assayed with or without 10 mM of various candidate inhibitors; see 
Supplementary Methods for details. These strains are also deleted of the cAMP 
phosphodiesterase which is not primary to signalling in cAMP-dependent 
CCR; see Supplementary Fig. 33. In b and d, data were expressed as 

mean + s.e.m. (n = 3). 


OAA inhibiting cAMP synthesis by adenylate cyclase (Supplementary 
Fig. 34), either directly or through the help of a mediator protein. 
Simple inhibition kinetics was found for OAA, with half-inhibition 
concentrations in the sub-millimolar range (Supplementary Fig. 35). 


Discussion 


cAMP-mediated CCR is one of the most intensely studied regulatory 
systems in biology. Results described here indicate that the physio- 
logical function of cAMP signalling is not centred on carbon meta- 
bolism per se as commonly thought; rather this pathway coordinates 
the allocation of proteomic resources (Fig. 2a) with different meta- 
bolic demands in different nutrient environments. This global regu- 
latory process is directed by an integral feedback system (Fig. 3a), 
driven by cAMP signals modulated by metabolic precursors such as 
several o-ketoacids. These precursors can be identified as the ‘cata- 
bolites’, the accumulation of which was proposed to induce CCR over 
50 years ago". 

The complexity of molecular interactions underlying this signalling 
system made it difficult to elucidate its physiological function in a 
bottom-up approach despite the wealth of molecular data known for 
this classic system. However, the top-down physiological approach 
described here was able to reveal simple quantitative relations between 
gene expression and growth rates; these relations provided key insight 
on the physiological function of cAMP signalling, leading to the elu- 
cidation of molecular strategy and implementation. The effectiveness 
of this approach provides a conceptual framework to pursue the use of 
quantitative physiological approaches to elucidate other complex sig- 
nalling networks. 


METHODS SUMMARY 


Material and method including strain construction, cell culture, LacZ activity assay, 
cAMP excretion rate assay, RNA/protein ratio assay, in vitro adenylate cyclase 
activity assay and theoretical analysis are provided in detail in Supplementary 
Information. 
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The initiation of mammalian protein 
synthesis and mRNA scanning mechanism 


Ivan B. Lomakin! & Thomas A. Steitz>? 


During translation initiation in eukaryotes, the small ribosomal subunit binds messenger RNA at the 5’ end and scans in the 5’ to 
3’ direction to locate the initiation codon, form the 80S initiation complex and start protein synthesis. This simple, yet intricate, 
process is guided by multiple initiation factors. Here we determine the structures of three complexes of the small ribosomal 
subunit that represent distinct steps in mammalian translation initiation. These structures reveal the locations of eIF1, eIF1A, 
mRNA and initiator transfer RNA bound to the small ribosomal subunit and provide insights into the details of translation 
initiation specific to eukaryotes. Conformational changes associated with the captured functional states reveal the dynamics of 
the interactions in the P site of the ribosome. These results have functional implications for the mechanism of mRNA scanning. 


In bacteria, translation initiation is controlled by initiation factors IF1, 
IF2 and IF3 (ref. 1). In eukaryotes, translation initiation is more com- 
plex: multiple initiation factors facilitate the recruitment and scanning 
of mRNA, selection of initiator tRNA, and the joining of the large and 
small ribosomal subunits**. Eukaryotic initiator tRNA binds speci- 
fically to eIF2-GTP, forming a stable ternary complex. The ternary 
complex forms the 43S pre-initiation complex (PIC) by binding to 
the small ribosomal subunit (40S). Formation of the 43S PIC is pro- 
moted by the eukaryotic initiation factors (eIF) eIF1, eIF1A, eIF3 and 
eIF5 (ref. 5). 43S PIC then binds to the capped 5’ end of mRNA through 
eIF4F and eIF3, and scans the mRNA up to the first AUG codon. 

The scanning hypothesis (first AUG rule) was proposed over 30 years 
ago® and subsequently refined to reflect that sequences surrounding 
AUG, particularly at the —3 and +4 positions with respect to the A, 
are critical for initiation in mammals’*. Despite widespread acceptance 
of this model, its molecular mechanism remains unknown7”. Scanning 
stops upon AUG recognition, triggering eIF1 dissociation. The release 
of phosphate from the ternary complex makes this reaction irreversible 
and promotes dissociation of e[F2-GDP’"'. The 48S PIC that is now 
formed contains the initiator tRNA base paired with AUG in the P site 
of the 40S subunit. The subsequent attachment of the large (60S) ribo- 
somal subunit is facilitated by the GTP binding factor eIF5B. eIF5B and 
eIF1A dissociate from the 40S subunit during or after 80S ribosome 
assembly, leaving the ribosome primed for elongation’. 

We have determined the crystal structures of the rabbit 40S subunit 
in complex with: (1) eIF1 (PIC1); (2) eIF1 and eIF1A (PIC2); and (3) 
mRNA, tRNA and eIF1A (48S PIC). These structures establish the 
positions of eIF1, eIF1A, mRNA and tRNA and reveal the architec- 
tural principles governing the assembly of the pre-initiation complex 
on mRNA. Comparison of these structures shows that a conforma- 
tional rearrangement in the P site and a rotation of the head of the 40S 
subunit accompany complex assembly. This provides the basis for 
understanding the functions of eIF1 and eIF1A during scanning. 


Crystallization and structure determination 

Rabbit 40S ribosomal subunit complexes were crystallized in the 
P3,21 space group and contain one 40S subunit per asymmetric unit 
(Fig. 1 and Supplementary Fig. 1). Complete data sets were collected to 
7.9-9 A resolution (Supplementary Table 1). The 48S PIC was formed by 
incubating 40S ribosomal subunits with mRNA, non-aminoacylated, 


in vitro transcribed initiator tRNA (tRNA,), and human eIF1A and 
eIF1. Although the presence of eIF1 in the reaction mixture is essential 
for crystallization, this factor is not seen in the 48S PIC structure. The 
PIC1 complex was formed by incubating 40S ribosomal subunits with 
elF1, and the PIC2 complex was formed by soaking eIF1A in crystals of 
PIC1 (Methods). 

The presence of additional electron densities in the maps allows us 
to identify the locations of eIF1, eIF1A, tRNA and mRNA unambigu- 
ously (Fig. 1 and Supplementary Fig. 2). We provide Escherichia coli 
nomenclature for ribosomal proteins and rRNA in parentheses. We 
focus predominantly on the details related to translation initiation, 
which include the interactions of initiation factors, tRNA and mRNA 
with the 40S ribosomal subunit and their implications for the mech- 
anistic understanding of the scanning process. 


Initiator tRNA in the P site 


The P site of the 40S subunit contains a binding pocket, occupied by 
the anticodon stem-loop (ASL) of tRNA;. This pocket is formed by the 
18S rRNA helices h28, h44 of the body, h24 of the platform, h28, h29, 
h30, h31 of the head domains and by three ribosomal proteins (rpS) 
located in the head of 40S subunit: rp$15(19), rpS16(9) and rpS18(13) 
(Fig. 2 and Supplementary Fig. 7). The regions of tRNA; that interact 
with rRNA are similar to those seen for the P site tRNA bound to 
the 70S ribosome’. However, the rRNA is arranged differently in 
the tRNA; binding pockets of PIC1, PIC2 and 48S PIC, reflecting 
the differences between the scanning-competent (PIC1, PIC2) and 
scanning-incompetent (48S PIC) conformations. Movement of the 
tRNA; towards the E site is blocked in the 48S PIC by helices h24 
and h29, which contact the ASL from the platform and head side of 
the 40S subunit, respectively (Supplementary Fig. 3). The position of 
h29 is different in the PIC] and PIC2 structures compared to that in 
48S PIC, and allows the P-site-bound tRNA to move towards the E site 
(Fig. 2b). The corresponding region of h29 has shifted ~4 A towards 
the E site in PIC1 and ~2 A in PIC2. The binding of tRNA; on the 
opposite side of the pocket in the 48S PIC might be stabilized by the 
carboxy terminus of rpS15(19) and by the amino-terminal tail (NTT) of 
elF1A, as shown by the presence of additional electron density near the 
ASL of the tRNA; in the corresponding map (Supplementary Fig. 4). 
The suggested orientation of the C terminus of rpS15(19) in the 48S PIC 
can explain the crosslink reported between rabbit rpS15(19) and the +4 


1 Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8114, USA. 2Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8114, USA. 


15 AUGUST 2013 | VOL 500 | NATURE | 307 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Figure 1 | The crystal structure of rabbit pre-initiation complexes. a, 48S 
PIC. b, PIC2. c, PIC1. B, body; H, head; h44, helix 44; P, platform. e[F1A 
(violet), eIF1 (red), tRNA; (green) and mRNA (yellow) are shown. The basic 
loop of eIF1 is marked by an arrow; the N terminus of eIF1A is marked by N. 


position on mRNA in the 48S complex". Positioning of the NTT of 
eIF1A near the ASL of tRNA; in 48S PIC is consistent with biochemical 
data demonstrating that NTT contributes to eIF1A binding to the 40S 
subunit’* and may interact with the ASL'®. The NTT of eIF1A remains 
unstructured in PIC2. We propose that the architecture of the P site in 
PIC1 and PIC2 represents the scanning-competent conformation of the 
40S subunit, with tRNA, in the P,,,, mode’. In contrast, the P site in 48S 
PIC represents a scanning-incompetent conformation of the 40S sub- 
unit, with the ASL of tRNA; in P;,, mode: locked in position by base 
pairing with AUG and interactions with the binding pocket, possibly 
with rpS15(19) and the NTT of eIF1A (refs 2, 17). This position of 
tRNA; is similar to the P/I state in the bacterial 30S initiation com- 
plex’*” (Supplementary Fig. 6). 

Rearrangements of the P site in the PICs are accompanied by a 3° 
clockwise rotation of the head of the 40S subunit after the binding of 
eIF1A to PIC1, and an additional 3° upon formation of the 48S PIC. 
Rotations of the head shift part of the tRNA; binding pocket towards 
the A site by ~2 A with each rotation. At the same time, h28, which 
comprises the neck that connects the head and body of the 40S sub- 
unit, does not change its position (Fig. 2b and Supplementary Fig. 5). 
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Figure 2 | Interactions of tRNA; and eIF1A with the 40S subunit. a, eI[Fl1A 
(violet) binds to h44, rpS30e (wheat) and rpS23 (green). Possible positions of 
the NTT of eIF1A and the C terminus of rpS15 (brown) are shown as dashed 
lines. Conserved regions of 18S rRNA, which form the tRNA; binding pocket, 
are in red. rpS16 (blue), rpS18 (pink) and rpS31 (orange) are shown. 

b, Superposition of 48S PIC, PIC1 and PIC2. Only regions of rRNA that are 
important for the interaction with the tRNA; are shown. Clockwise rotation of 
the head domain is indicated. 
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d-f, Initial unbiased electron density maps F, — F, contoured at o = 2.0 (green 
mesh) for 48S PIC (d), PIC2 (e) and PIC] (f). Docked structures are shown in 
black. 


The role of h28 as a pivot point for head rotation suggests that muta- 
tions in the neck affect rotation of the head and repress the conforma- 
tional rearrangements of the P site that are required during scanning. 
This probably explains the leaky scanning phenotype seen in muta- 
tional studies of h28 in the yeast 18S rRNA”. 


The mRNA channel and the mRNA path 

The electron density map of the 48S PIC reveals the position of the 
mRNA on the 40S subunit from nucleotides —6 to +7 (Fig. 3a). Its 
conformation does not have the sharp kink between the A and P site 
codons that is seen in the bacterial 70S ribosomal complexes. This kink 
allows the simultaneous pairing of the A and P site tRNA anticodons 
with the mRNA”! (Fig. 3a), which defines the reading frame and pre- 
vents slippage of the mRNA”. The conformation of the mRNA in the 
scanning complex should promote the slippage of mRNA. eIF1A 
assists the slippage of the 40S subunit along the mRNA by occupying 
the A site, which prevents possible interactions between tRNAs and 
mRNA that might otherwise fix the mRNA on the ribosome (Fig. 3a 
and Supplementary Fig. 7). 

The mRNA path in the 48S PIC is similar to that in 70S ribosomal 
complexes*”’. Eukaryote-specific features are localized in the mRNA 
channel near nucleotides —3 and +4, which are the most sensitive 
positions in the consensus sequence around the initiation codon, and 


Figure 3 | Path of the mRNA in the 48S PIC. a, Initial unbiased electron 
density map F, — F, contoured at ¢ = 2.0 (green mesh). eIF1A (violet), mRNA 
in eukaryotic 48S PIC (yellow) and mRNA from bacterial 70S translating 
ribosome (pink) are shown. Nucleotides +4 and —3 of the mRNA are in dark 
green. b, rpS26e is ~5 A away from nucleotide —3 of the mRNA. The C 
terminus of rpS28e is ~7 A from the —5, —6 region of the mRNA. 
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also upstream of the E site, because eukaryotes lack the Shine-Dalgarno 
sequence (SD)”*”*. Two ribosomal proteins, rpS26e and rpS28e, which 
do not have homologues in eubacteria are present in these regions. 
rpS26e has an o-helix (C74-V83) that is located ~5A away from 
positions —4 and —3 on the mRNA as well as from h28 (Fig. 3b). 
This helix may change the orientation of the —3 nucleotide during 
scanning, thus affecting its interaction with the o-subunit of eIF2, 
implicated in the discrimination of the AUG context at position —3 
(ref. 14). Although rpS26e is bound to the platform of the 40S subunit, 
its N terminus interacts with the neck of the 40S subunit, possibly 
facilitating communication between position —3 on the mRNA, the 
head domain and the platform. 

mRNA position —6 is located in the vicinity of rpS28e. This region of 
mRNA is of particular interest because it is involved in the SD inter- 
action in prokaryotes and modulates translation initiation on non-AUG 
initiation codons in eukaryotes. The C terminus of rpS28e contains 
three conserved arginine residues, located ~7 A from nucleotide —6, 
which may interact with it (Fig. 3b). We suggest that these eukaryote- 
specific interactions may stabilize the position of mRNA during ini- 
tiation. The initiation codon and nucleotide +4 are located in a gorge 
formed by the top of h44 (Fig. 3a and Supplementary Fig. 7). We propose 
that nucleotides 1697(1401) and 1820(1494) of h44 of 18S rRNA, 
together with the NTT of eIF1A, modulate the orientation of nucleotide 
+4 of the mRNA, which may pause mRNA sliding during scanning 
(Fig. 3a). This pause would allow the AUG codon or ASL of tRNA; to 
be accommodated for codon-anticodon base pairing. The presence of 
electron density attributed to the NTT of eIF1A in the 48S PIC (Sup- 
plementary Fig. 4) indicates that upon base pairing with the initiation 
codon, the NTT might stabilize the P;,, conformation of the tRNA;’” by 
interacting with the mRNA’s +4 nucleotide and/or with the ASL of 
tRNA; (Fig. 2 and Supplementary Fig. 7). 


Initiation factor eIF1A and the latch 


The universally conserved eIF1A stimulates binding of the ternary 
complex to the 40S subunit and is involved in the scanning and 
selection of the initiation codon’. Its bacterial orthologue IF1 is bound 
to the A site of the 30S ribosomal subunit, where it sterically prevents 
the binding of tRNA”. The location of the binding site of eIF1A on the 
40S ribosomal subunit was previously mapped to the A site’®. An 
NMR structure of eIF1A revealed that its main domain has an oligo- 
nucleotide/oligosaccharide binding fold that is similar to IF1, an addi- 
tional C-terminal subdomain with two «-helices, and long unstructured 
N- and C-terminal tails (CTT)”*. Whereas the CTT is required for 
stringent AUG selection by promoting the scanning-competent con- 
formation of the initiation complex, mutations in the NIT suppress the 
elevated level of UUG initiation’*””. 

Electron density maps of 48S PIC and PIC2 show clear density for 
eIF1A (Fig. 1)’°. Whereas the CTT of eIF1A remains disordered, the 
NTT is partially visible in the 48S PIC. The additional density in the 
difference map (F, — F.) of the 48S PIC can be attributed to the NTT 
of eIF1A. In this case, it is located 7-8 A from nucleotide +4 of the 
mRNA and the ASL of the tRNA; (Fig. 2 and Supplementary Fig. 4). 
eIF1A binds to the 40S subunit at the top of h44, which forms a part of 
the intersubunit bridge B2a (ref. 28), and to h18 of the 18S rRNA 
through its oligonucleotide/oligosaccharide binding domain, similar 
to that of IF1 (ref. 25). eIF1A interacts with the N terminus of rpS30e 
and with rpS23(12) (Fig. 2a and Supplementary Fig. 7). This position 
of eIF1A ensures that the A site on the 40S subunit remains inaccess- 
ible to tRNAs during initiation. 

eIF1A forms a bridge between the head and the body of the 40S 
subunit through its helical subdomain (PIC2 and 48S PIC) and NTT 
(48S PIC) (Figs 2a and 4). These eukaryote-specific interactions are 
consistent with hydroxyl radical cleavage experiments'®. Bridging 
together the head and the body of the 40S subunit ensures the closure 
of the mRNA channel around the A site, which promotes scanning of 
the mRNA. 
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The ‘latch’ of mRNA entry channel remains closed in the structures 
of all three PICs presented (Fig. 4a). It is formed by interactions 
between h18 of the body with h34 and ribosomal protein rpS3(3) of 
the head, which clamp around the incoming mRNA”. It was pro- 
posed that the closed conformation of the latch is a feature of ribo- 
somes in the active conformation, which “provides a geometry that 
guarantees processivity and ensures maximum fidelity”*’. We pro- 
pose that closure of the latch is necessary for the scanning function of 
the 40S ribosomal subunit. Closing of the latch and eIF1A binding 
alter the environment of the mRNA binding channel and ensure that 
the mRNA remains unstructured, correctly oriented for scanning and 
free to move towards the exit. 

The closed conformation of the latch in our structure of PIC2 is 
different from the open conformation of the latch in the cryo-electron 
microscopy reconstruction of yeast 40S subunit complex with eIF1 
and eIF1A (ref. 29). It is possible that eIF3 may be involved in the latch 
opening in mammals, because unlike in yeast, eIF3 is required for 
ternary complex to bind to the 40S subunit. The latch closure might be 
influenced by crystallization, although the crystal packing does not 
affect rotation of the head domain of the 40S subunit. Furthermore, 
the latch could sample different conformational states during ini- 
tiation. The open conformation of the latch allows direct loading of 
the mRNA in the mRNA binding channel, instead of threading it 
through a tunnel”’. We propose that the cryo-electron microscopy 
structure of the yeast 40S subunit with bound eIF1 and eIF1A repre- 
sents the mRNA binding state of the ribosome, whereas the rabbit 
PIC2 represents its scanning mode. 


elF1 is poised to destabilize tRNA; binding 


eIF1 is a functional homologue of the bacterial IF3 (refs 31, 32). 
During scanning, eIF1 allows proofreading and sensing a non- 
optimal context of AUG codons, while promoting the dissociation 
of aberrantly assembled complexes*’*. eIF1 binds to the top of h44 
and to helices h24 and h45 of the 18S rRNA in the vicinity of the P site 
(Figs 1 and 4b), consistent with previously published studies*!”>”*. 
The region of interaction between eIF1 and 18S rRNA consists of 
the intersubunit bridge B2a and part of the bridge B2b (ref. 28). 
Superposition of the 48S PIC onto PIC1 and PIC2 reveals moderate 
sterical clashes between eIF1 and tRNA;, as was predicted earlier by 
comparison of structures of 40S subunit complex with eIF1 from 
Tetrahymena thermophila and bacterial 70S ribosomal complexes”. 
The basic loop of eIF1 (R38-K42) in both PIC] and PIC2 is positioned 
between the ASL of tRNA, and nucleotides 1821-1823(1495-1497) of 
h44, which stabilizes the interaction between AUG and the anticodon 
in the P site of the 48S PIC. This results in steric clashes between the 
basic loop of eIF1 and the ASL of tRNA; and between the region 
around residues P77-G80 of eIF1 and the D stem of tRNA; (Fig. 4b). 


di YY / 
[\BSSto7 
Figure 4 | The latch and initiation factor eIF1. a, Closed conformation of the 
latch. Interactions between body and head domains are connected by dashed 
lines. b, The P site of the 40S subunit. eIF1 from PIC1 (pink) or PIC2 (red) 
would sterically clash (red X) with the ASL and the D stem of tRNA; (green) 
that is positioned in the 48S PIC. 
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It has been proposed that eIF1 modulates recognition of the start 
codon by restricting tRNA; binding to AUG only in an optimal con- 
text****, Fluorescent resonance energy transfer experiments (FRET) 
with eIF1 and eIF1A demonstrated that AUG recognition triggers the 
dissociation of eIF1 from 48S PIC, which allows the release of the 
phosphate (P;) from the elF2-GDP-P; complex'®"’. Our data provide 
an explanation by revealing that eIF1 spatially interferes with the 
adjustment of the ASL of the initiator tRNA in the P site during 
scanning. Establishing the interaction between the AUG codon and 
the anticodon results in locking of the P site tRNA, which can prevent 
the basic loop of eIF1 from re-entering the P site. Kinetic experiments 
show that the dissociation constant of eIF1 from the 40S subunit is 
around 1.4nM in the presence of the ternary complex, whereas the 
high affinity of the ternary complex to the 40S subunit results from 
base pairing with mRNA”. This affinity is low when no mRNA is 
bound to the 40S subunit or when only one nucleotide in the P site is 
base paired. It is increased with the ability of the initiator tRNA to base 
pair with two nucleotides and becomes very strong upon base pairing 
with the AUG (dissociation constant (Kg) < 1nM)’’”’. At this point, 
upon the correct codon-anticodon base pairing eIF1 cannot compete 
with the ASL of tRNA; for the P site and must dissociate. 

The basic loop of eIF1 is located within 5 A of h44 and 10 A of the +4 
position on mRNA (Fig. 4b). That may allow eIF1 to ‘inspect’ the 
important region downstream of AUG. The same region of h44 under- 
goes small conformational changes upon formation of 48S PIC, which 
can weaken its interaction with eIF1 and contribute to the dissociation 
of eIF1 from the 40S subunit. Consequently, our structures of rabbit 
PICs provide mechanistic insights into the current model of eukaryotic 
translation initiation. 


Scanning model 


The observed conformations of the pre-initiation complexes described 
in this study together with previous biochemical data’ allow us to 
propose a model for scanning. Briefly, the mRNA slides through the 
tunnel, formed by the body and the head of the 40S subunit (the latch), 
eIF1A and eIF1. The tunnel ensures scanning processivity by keeping 
the mRNA unstructured and properly oriented for the ‘inspection’ of 
the nucleotide sequence in the P site by tRNA;. The tRNA; ‘inspects’ or 
scans the mRNA by attempting to establish Watson—Crick base pairing 
between its anticodon and a nucleotide triplet of mRNA moving 
through the P site. The tRNA; is in the Pout state because the orientation 
of the head domain of the 40S subunit allows a minor movement of the 
tRNA; to avoid a steric clash with elF1. The affinity of the basic loop of 
elF1 to the P site is sufficient to displace the ASL of tRNA; and prevent 
the locking of the tRNA; in the P site (Fig. 5). 

When the AUG codon is in the P site it becomes base paired with all 
three nucleotides of the anticodon of the tRNA;,, thereby stabilizing 
the conformation of the tRNA, while allowing it to displace the basic 
loop of eIF1. Subsequent rotation of the head domain locks tRNA; in 
the P;,, state and completes the scanning (Fig. 5). The transition from 
Pout to Pin mode of tRNA; is probably promoted by the C terminus of 
rpS15(19) and the NTT of eIF1A (ref. 17), which together might 
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Figure 5 | Scanning model. The basic loop of eIF1 competes for the P site of 
the 40S subunit with the ASL of tRNA;, which results in displacement of each 
other during mRNA scanning. 
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stabilize the base-paired conformation of tRNA; in the P site. In 
addition, a similar effect may be caused by the interactions of the 
nucleotide +4 on the mRNA with h44, eIF1A and elIF1, and the 
nucleotide —3 with h23, rpS5(7) and rpS26e. The dissociation of 
eIF1 and eIF2 from the 40S subunit results in a stable 48S PIC similar 
to the one observed in this study. eIF5B then binds to this complex 
and orients the acceptor stem of tRNA; towards the P site on the 60S 
subunit to facilitate subunit joining in a manner that is similar to its 
bacterial analogue IF2 (Supplementary Fig. 6). 

Conformational changes in the P site may affect the interaction of 
eIFlwith the top of h44 to which it binds. This probably provides 
synchronization between the rotations of the head domain of the 40S 
subunit, which rearrange the P site, and the presence of eIF1 on the 
ribosome. Interaction of eIF1A with h44 may cause the same effect 
and explain the cooperative binding of eIF1A and eIF1 to the 40S 
subunit””*. Future high-resolution structures of the 40S ribosomal 
subunit complexes with the initiation factors, mRNA and tRNA, 
coupled with biochemical studies, are needed to refine and complete 
the model presented here. 


METHODS SUMMARY 


40S ribosomal subunits were purified from rabbit reticulocyte lysate. Phases for 
calculating the electron density maps were determined by molecular replacement 
using as a search model the structure of the 40S subunit from the human 80S 
ribosome (Supplementary Tables 1-3 and Methods”). Modelling of additional 
rRNA and protein regions was impossible due to restrictions imposed by the 
resolution limits. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cloning and protein expression. The DNAs encoding full-length human eIF1 
and eIF1A with additional N-terminal affinity tag and TEV protease cleavage site 
(|) sequences (MGHHHHHHDYDIQTTENLYFQ|G) were synthesized by PCR 
and cloned into pET28b expression vector (Novagen). Recombinant initiation 
factors were expressed in E. coli Rosetta2(DE3)pLysS cells (Novagen). 

Protein purification. Recombinant eIF1 and eIF1A were purified using a HisTrap 
HP column (GE Healthcare), essentially as described*’. The proteins were then 
treated with AcTEV protease (Invitrogen) according to the manufacturer’s instru- 
ctions. The affinity tag was removed by chromatography on a HisTrap HP column 
(GE Healthcare). Tag-free eIF1 and eIF1A were further purified on Mono S and 
Mono Q columns (GE Healthcare), respectively. Finally, both proteins were puri- 
fied on a Superdex 75 column (GE Healthcare) equilibrated with buffer H300 
(5mM HEPES-KOH, 300mM KCl, 5% glycerol, pH 7.1). eIFl and eIF1A were 
frozen in liquid nitrogen in small aliquots and stored at —80 °C. 

Initiator tRNA. Mammalian initiator tRNA“ was transcribed using T7 RNA 
polymerase and chemically synthesized DNA oligonucleotides as suggested in 
ref. 42. tRNA was extracted by a phenol chloroform mixture (5:1, pH 4.3) and 
precipitated by ethanol. tRNA was dissolved in buffer (10 mM Tris-HCl, 90 mM 
KCl, 3.0 mM Mg acetate, 0.5 mM EDTA, pH 7.4) and purified on a Superdex 75 
column (GE Healthcare), equilibrated with the same buffer. tRNA was not ami- 
noacylated. tXNA was concentrated to 2.9 mg ml ', frozen in liquid nitrogen in 
small aliquots and stored at —80°C. 

mRNA. A 24-nucleotide-long mRNA with an AUG start codon was chemically 
synthesized by Thermo Fisher Scientific; the sequence was 5’-CAACAACAACA 
ACAAAUGUUUCAA-3’. 

Purification of 40S ribosomal subunits from rabbit reticulocyte lysate (RRL). 
Frozen RRL was purchased from Green Hectares. The following components 
were added to 11 of thawed RRL: 5mM HEPES-KOH (pH7.1), 15mM KCl, 
1lmM Mg acetate, 1.0mM EDTA (pH8.0), 2mM DTT, 104M MG132, 
500 mg heparin, 150mg Pefabloc, 12 tablets of Complete Protease Inhibitor 
Cocktail (Roche). Ribosomes were pelleted by centrifugation at 40,000 r.p.m. 
for 10.5h in a Beckman Type 45 Ti rotor at 4°C through a sucrose cushion 
(7 ml/tube; 30% (w/v) sucrose, 20 mM Bis-Tris (pH 5.9), 200 mM NH, acetate, 
300mM KCl, 10mM Mg acetate, 5mM DTT). The pellet was re-suspended 
in 25 ml of buffer A (20mM HEPES-KOH, 30mM KCl, 11mM Mg acetate, 
1mM EDTA, 2mM DTT, 104M MG132, 1.0mg ml‘ heparin, 0.2 mg ml’ 
Pefabloc, 1 tablet/100 ml Complete Protease Inhibitor Cocktail (Roche), 200 Um! ! 
SUPERaseIN (Ambion), pH 7.1). Subsequently, NH4Cl was added to the suspen- 
sion to a final concentration of 0.4 M and the suspension was mixed on ice for at 
least 0.3 h. One-third of the suspension was then clarified by centrifugation in an 
Eppendorf microcentrifuge for 10 min at 13,200 r.p.m. at 4°C and loaded on a 
Sephacryl S-400 HR column (GE Healthcare) equilibrated with buffer B (20 mM 
Tris-HCl, 500 mM KCl, 10mM MgCh, 5% (w/v) glycerol, 2mM DTT, pH7.4). 
This step was repeated for the rest of the ribosomal suspension. Fractions (8 ml) 
were collected and analysed by SDS-PAGE (4-12%, MES SDS running buffer, 
NuPAGE). Those containing 80S ribosomes were combined, brought to a final 
concentration of 2.5mM MgCl and 1% glycerol and concentrated to a final 
volume of 25 ml. Optical density of concentrated ribosomes (OD 60 nm) should 
be 200-350 U ml _'. Ribosomes were subsequently supplemented with 1 mM pur- 
omycin and incubated for 20 min at 37 °C followed by 20 min incubation at room 
temperature and then chilled on ice for at least 10 min. 

Chilled ribosomes were clarified by centrifugation in an Eppendorf microcen- 
trifuge for 10 min at 13,200 r.p.m. at 4 °C and loaded on a 10-40% sucrose density 
gradient, prepared in tubes for Beckman SW 32 Ti rotor (10-40% (w/v) sucrose, 
20 mM Tris-HCl, 500 mM KCl, 5mM Mg acetate, 2mM DTT, pH 7.4). Sucrose 
density gradient centrifugation was performed at 25,000r.p.m. for 14.5h in a 
Beckman SW 32 Ti rotor. Fractions (1 ml) were collected and analysed by 
SDS-PAGE (4-12%, MES SDS running buffer, NuPAGE). Fractions containing 
40S subunits were combined, diluted five times with buffer B without KCl and 
loaded on a DEAE column equilibrated in buffer B with 50 mM KCL. Elution was 
performed by a 0.05-1.0M KCI gradient in buffer B. Fractions containing 40S 
subunits were combined, concentrated to 391 Uml * (OD3¢60.nm) and dialysed 
against buffer C (10 mM HEPES-KOH, 30 mM KCl, 5 mM Mg acetate, 5% (w/v) 
glycerol, 2mM DTT, pH7.1). Small aliquots were frozen in liquid nitrogen and 
stored at —80 °C. 

Activity test. Purified 40S ribosomal subunits were tested by toeprinting analysis 
for their ability to form initiation complexes with mRNA containing CrPV IRES, 
as described previously*’. 

Complex formation. Before crystallization, PIC] complex was formed by mixing 
ribosomes and eIF1 to final concentrations of 2.6 [1M and 9.1 |.M, respectively, with 
subsequent dialysis against buffer DB (2.0mM HEPES-KOH, 100mM NH,Cl, 
15mM KCl, 3 mM Mg acetate, 2mM TCEP, pH 7.1) for 2h at room temperature. 


The complex was incubated at 37 °C for 15 min and kept at room temperature for 
crystallization setup. Complex PIC2 was formed by soaking eIF1A (final concen- 
tration 401M) during freezing crystals of the PIC1. The 48S PIC was formed 
similarly to PIC1 but with the addition of eIF1A (final concentration 7.5 UM), 
mRNA (10 11M) and tRNA; (10 1M). Although the presence of eIF1 in the reaction 
mixture is essential for crystallization, this factor is not seen in the 48S PIC structure. 
We suggest that it promotes the homogeneity of the complex, which is required for 
crystal growth, by controlling the correct recognition of the initiation codon by 
tRNA; and by dissociating aberrantly assembled ribosomal complexes****. 
Crystallization and cryoprotection. Crystals were grown in 24-well siting-drop 
plates using the vapour diffusion technique. Four microlitres of the PIC1 were 
mixed with 4 ul of reservoir solution (50mM Tris-HCl, 100 mM NH, acetate, 
5mM Mg acetate, 50mM sarcosine, 5mM urea, 2.5% PEG 20000, 5% MPD, 
2mM TCEP, pH 8.5). Plates were incubated at 19 °C for 6-9 days. Crystals were 
stabilized by stepwise increase of MPD to the final concentration of 35%. After 
stabilization, crystals were frozen in a cryo stream. For PIC2 complex formation, 
crystals were stabilized in the same buffer containing 40 1M of eIF1A. 

48S PIC (41) was mixed with 7.4 ul of reservoir solution (50mM Tris-HCl, 

100mM NH, acetate, 5mM Mg acetate, 50 uM Zn acetate, 2mM putrescine, 
2.5% PEG 20000, 5% 1,4-butanediol, 2 mM TCEP, pH 8.5). Plates were incubated 
at 19°C for 6-13 days. Crystals were stabilized at 4°C by stepwise increase of 
1,4-butanediol and PEG 20000 to the final concentration of 30% and 3.0%, 
respectively. After stabilization, crystals were frozen in liquid nitrogen. 
Data collection and processing. Data collection was carried out at 100K at 
beamlines X25 and X29 at Brookhaven National Laboratory and 24ID at the 
Advanced Photon Source. A complete data set was collected from a single crystal 
(PIC1, PIC2) or from two crystals (48S PIC, Supplementary Table 1). Data were 
integrated and scaled with the XDS program package”. All crystals belong to the 
hexagonal space group P3,21 (Supplementary Table 1). 

The structure was solved by molecular replacement using PHASER from the 
CCP4 program suite’. The search model was generated from the previously 
published structure of yeast 80S ribosome (ref. 28, Protein Data Bank entries 
3U5B and 3U5C). Both the 60S subunit and the suppressor protein STM1 were 
excluded from the search model. Before refinement all B factors in the model were 
set to an isotropic B of 80. The initial molecular replacement solution was refined 
by rigid-body refinement with the 40S subunit split into multiple domains 
(Supplementary Table 2), followed by five cycles of grouped TLS and grouped 
B-factor refinement using PHENIX”**. Bulk solvent correction was applied as 
recommended”. After initial refinement, the difference electron density corres- 
ponding to each initiation factor, mRNA and tRNA became clearly visible in the 
difference electron density map (F, — F., Fig. 1). The NMR structures of human 
eIF1 and eIF1A (refs 36, 26, PDB entries 2IF1 and 1D7Q) without their flexible 
tails were docked into the corresponding electron density, followed by refinement 
using PHENIX“*. Structures of the mRNA and P site tRNA were taken from the 
previously published structure of the 70S ribosome with bound mRNA, and 
tRNAs (ref. 13, PDB entry 2J00) were docked in the corresponding electron 
density in 48S PIC map and refined as above. The position of the mRNA (except 
AUG) was manually adjusted to fit the electron density before their refinement in 
PHENIX**. mRNA and tRNA were treated as one ridged body during refinement. 

Despite that at a resolution of about 8-11 A the structures cannot be built de 
novo, the availability of the high-resolution models allows the detection of the 
movement of the structural domains of the ribosome, the location of positions of 
the bound protein factors and even antibiotics’. The conformational changes 
of the 40S subunit described in our manuscript are reproducible during refine- 
ment independently of the starting model: for example, refining the 40S structure 
from PIC1 against the data of PIC2 results in conformation of PIC2, whereas 
refining the 40S structure from PIC2 against the data of PIC1 results in con- 
formation of PIC1. That assures us that movements of the domains of the 40S 
subunit that were observed are not biased to the starting models. 

When the manuscript was under review the high-resolution cryo-electron 
microscopy structure of the human 80S ribosome was published”. Because of 
the high identity between rabbit and human ribosomal proteins and 18S rRNA 
(Supplementary Table 3) we used the structure of the human 40S subunit for the 
final refinement. 

Figures. Figures showing electron densities and atomic models were generated 
using PYMOL (The PyMOL Molecular Graphics System, http://www.pymol. 
org). 
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A variable absorption feature in the X-ray spectrum 
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Soft-y-ray repeaters (SGRs) and anomalous X-ray pulsars (AXPs) are 
slowly rotating, isolated neutron stars that sporadically undergo epi- 
sodes of long-term flux enhancement (outbursts) generally accompan- 
ied by the emission of short bursts of hard X-rays’”. This behaviour can 
be understood in the magnetar model*°, according to which these 
sources are mainly powered by their own magnetic energy. This is 
supported by the fact that the magnetic fields inferred from several 
observed properties” * of SGRs and AXPs are greater than—or at the 
high end of the range of—those of radio pulsars. In the peculiar case of 
SGR 0418+5729, a weak dipole magnetic moment is derived from its 
timing parameters’, whereas a strong field has been proposed to reside 
in the stellar interior'®” and in multipole components on the surface’. 
Here we show that the X-ray spectrum of SGR 0418+5729 has an 
absorption line, the properties of which depend strongly on the star’s 
rotational phase. This line is interpreted as a proton cyclotron feature 
and its energy implies a magnetic field ranging from 2 X 10'* gauss to 
more than 10*° gauss. 

On 2009 June 5 two short bursts of hard X-rays, detected by Fermi 
and other satellites, revealed the previously unknown source SGR 
0418+5729"°. Subsequent observations with the Rossi X-ray Timing 
Explorer (RXTE), Swift, Chandra and X-ray Multi-mirror Mission 
(XMM) Newton satellites found the new SGR to be an X-ray pulsar 
with a period of ~9.1 s anda luminosity of ~1.6 X 10°“ ergs ' (in the 
0.5-10 keV band and for a distance of 2 kpc)'*"*. During the three 
years after the onset of the outburst, the spectrum softened and the 
luminosity declined by three orders of magnitude, but remained still 
too high to be powered by rotational energy”’®"*. The measured spin- 
down rate of 4 X 107° ss _' translates (under the assumption ofa rotat- 
ing magnetic dipole in vacuo) into a magnetic field B= 6 X 10'* Gat the 
magnetic equator’, a value well in the range of normal radio pulsars. 
However, the presence of high-order multipolar field components of 
10'* G close to the surface has been invoked to interpret the spectrum of 
the source in the framework of atmosphere models’. In any case, a 
strong crustal magnetic field (>10'* G) seems to be required to explain 
the overall properties of SGR 0418+ 5729 within the magnetar model”"'. 

Hints of the presence of an absorption feature at 2 keV in the spec- 
trum of SGR 0418+5729 were found in the phase-resolved analysis of 
data (with relatively low-count statistics) from the Swift X-ray Tele- 
scope (XRT) taken during 2009 July 12-16"*. Thanks to the large col- 
lecting area and good spectral resolution of the European Photon 
Imaging Camera (EPIC), we were able to perform a more detailed 
investigation using data collected by XMM-Newton during a 67-ks long 
observation performed on 2009 August 12, when the source flux was 
still high (5X 10° '*ergcm *s_' in the 2-10 keV band). 

To examine the spectral variations as a function of the star’s rota- 
tional phase without making assumptions about the X-ray spectral 
energy distribution of SGR 0418+5729, we produced a phase-energy 
image by binning the EPIC source counts into energy and rotational 


phase channels and then normalizing to the phase-averaged energy spec- 
trum and pulse profile. The normalized phase—-energy image (Fig. 1) 
shows a prominent V-shaped feature in the phase interval ~0.1-0.3. 
This is produced by a lack of counts in a narrow energy range with respect 
to nearby energy channels, that is, an absorption feature at a phase- 
dependent energy. The regular shape of the feature in the phase-energy 
plane as well as its presence in the three independent EPIC detectors (see 
Supplementary Fig. 5) exclude the possibility that it results from statistical 
fluctuations in the number of counts or from an instrumental effect. 
Another absorption feature is visible at low energies at phase ~0.5-0.6. 

We extracted from the EPIC data the phase-averaged spectrum of 
SGR0418+5729, as well as the spectra from 50 phase intervals of width 
0.02 rotational cycles, as described in the Supplementary Information. 
The phase-averaged spectrum can be adequately fitted by either a two- 
blackbody model (v? =1.198 for 196 degrees of freedom, d.f.) or a 
blackbody plus power-law model (7? =1.105 for 196 d.f.), corrected 
for interstellar absorption (see refs 11 and 12 for other models that can 
fit the spectrum). 

The 15 spectra extracted from the phase intervals 0.1-0.3 and 0.5-0.6, 
unlike those of the remaining phases, cannot be fitted by a renorma- 
lization of the phase-averaged best-fit model, which gives in most cases 
null hypothesis probabilities in the range 107 *-107° (see Supplemen- 
tary Fig. 4). They are instead well fitted (null hypothesis probability 
>0.03) by the addition of a narrow absorption line component, which 
can be equally well modelled with a Gaussian profile or a cyclotron 
absorption line model’* (the improvement obtained by adding a cyclo- 
tron component in the phase intervals 0.1-0.3 and 0.5-0.6 can be seen 
in Supplementary Fig. 4). The best-fit line parameters as a function of 
phase are shown in Fig. 2 and an example ofa phase-resolved spectrum 
is displayed in Fig. 3. 

We searched for the phase-dependent absorption feature in all the 
available X-ray observations of SGR 0418+5729 and found that it was 
present in the phase interval 0-0.3, and up to higher energies than in 
XMM-Newton, in RXTE data taken during the first two months of the 
outburst (see Supplementary Fig. 6). 

Absorption features have been observed in the X-ray spectra of 
various classes of neutron stars'°*’ and interpreted as being due to 
either cyclotron absorption (by electrons or protons) or bound—bound 
atomic transitions. However, variations in the line energy asa function 
of the rotational phase as large as in SGR0418+5729 (bya factor of ~5 
in one-tenth of a cycle) have not been seen in any source. 

In a neutron star atmosphere, different atomic transitions might be 
responsible for a phase-variable absorption feature if temperature, ele- 
mental abundance or magnetic field vary strongly on the surface. The 
line energies observed in SGR 0418+5729 (~1-5 keV) rule out transi- 
tions in magnetized H and He, which occur below ~1 keV (refs 24, 25). 
On the other hand, the absorption spectra of heavier elements are much 
more complex (see, for example, ref. 26 for C, O and Ne) and some lines 
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could occur at high enough energies. However, to explain the phase 
resolved spectra of SGR 0418+5729, the physical conditions of a 
heavy-element atmosphere are forced to vary in such a way that a single 
transition should dominate the opacity at each of the phases where the 
absorption line is detected. 

A more straightforward explanation for the line variability can 
instead be given if the feature is due to cyclotron resonant scattering. 
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Figure 2 | Results of the phase-resolved spectroscopy of SGR 0418+5729. 
a, Pulse profile obtained by folding the 0.3-10 keV EPIC positive—-negative 
junction (pn) detector light curve at the neutron star spin period P = 9.07838827 s. 
The data points are the number of counts in each phase-dependent spectrum. 
b-d, Line energy (E b), width (W; c) and depth (D; d) of the cyclotron feature as a 
function of the spin phase. The model consists of a blackbody plus a power law and 
an absorption line, modified for the interstellar absorption (see Supplementary 
Information). For the line we used the cyclotron absorption model from ref. 15: 


2 
F(E) =exp ( D ers) 
(E—E,)° + W? 
photon index and relative normalizations of the two components were fixed to the 
best-fit values of the phase-integrated spectrum: Ny = (9.6 + 0.5) X 107! cm”, 
kT = 913 + 8eV, [= 2.8 + 0.2, (Rpp/d)” = 0.81 + 0.03 km? kpe * and Kp, = 


(1.5 + 0.2) X 1073 photons cm *s keV | at 1 keV. Vertical error bars, 1s.d. 
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Figure 1 | Phase-dependent spectral feature in 
the EPIC data of SGR 0418+5729. Normalized 
energy versus phase image obtained by binning the 
1.4 EPIC source counts into 100 phase bins and 100- 
eV-wide energy channels and dividing these values 
first by the average number of counts in the same 
1.2 energy bin (corresponding to the phase-averaged 


O energy spectrum) and then by the relative 0.3- 
116 : : 

€ 10 keV count rate in the same phase interval 
1 3 (corresponding to the pulse profile normalized to 
ate = the average count rate). The red line shows (for 


only one of the two displayed cycles) the results of a 
0.8 simple proton cyclotron model consisting of a 
baryon-loaded plasma loop emerging from the 
surface of a magnetar and intercepting the X-ray 
0.6 radiation from a small hotspot (see Supplementary 
Fig. 7 and Supplementary Table 1). 


The cyclotron energy (in keV) for a particle of charge e and mass m in 
magnetic field B (in gauss) is given by 


11.6 me B 
Bree) Gin) 

1+z\m/\10¥ 
where (1 +z) '= [1 —2GMys/(Rc’)]” (which is ~0.8 at the star 
surface for typical neutron star mass and radius Mys = 1.4Mo and 
Rys = 12 km, respectively) accounts for the gravitational redshift at 
distance R from the neutron star centre, m, is the mass of the electron, 
and c is the velocity of light. In this case, the phase variability of the 
feature energy would simply be due to the different fields experienced 
by the charged particles interacting with the photons directed towards 
us as the neutron star rotates. 

If the absorbers and scatterers are electrons hovering near the star 
surface, the expected line energy is ~70 keV for the dipole field at the 
equator of SGR 0418+5729 (B= 6 X 10' G); this line energy is more 
than 10 times higher than that observed. A possible way to explain this 
large discrepancy might be to assume that the electrons producing the 
line are located higher up in the magnetosphere in a dipolar geometry, 
where the magnetic field is smaller (R~ 3Rys to have Eg ~ 2 keV). 
Moreover, such an electron population should also be nearly mono- 
energetic, or subrelativistic, in order to prevent Compton scattering 
from washing out the feature, which would require a mechanism to 
maintain slowly moving electrons confined in a small volume high in 
the magnetosphere. 

If the particles responsible for the cyclotron scattering are protons, 
the energy range of the SGR 0418+5729 spectral feature requires a 
magnetic field >2 X 10'* G (it would be even larger for heavier ions). 
In the framework of the magnetar model, the unprecedented phase- 
variability of the line energy can be explained by the complex topology 
of magnetar magnetospheres, in which global and/or localized twists 
play an important part’. This is particularly true for SGR 0418 +5729, 
which has a weak dipolar component, as testified by the small spin- 
down value, whereas a much stronger internal magnetic field has 
been advocated to explain its X-ray luminosity and burst activity'®”. 
Furthermore, the presence of small-scale, strong, multipolar compo- 
nents of the surface field has been inferred by fitting its phase-averaged 
X-ray spectrum with models of magnetized neutron star atmospheres”. 

In this context, the observed line variability might be due to the 
presence of either strong magnetic field gradients along the surface 
or vertical structures (with a spatially dependent field) emerging from 
the surface. To work out how the dynamic magnetosphere of a mag- 
netar should look, an analogy with the solar corona in the proximity of 
sunspots has been proposed (see, for example, ref. 27). In particular, 
localized, baryon-rich magnetic structures (in the form of rising flux 
tubes) or ‘prominences’ (produced by magnetic reconnection or the 
emergence of the internal field near a crustal fault) have been proposed 
to explain some of the observed properties of the giant flare emitted 
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Figure 3 | Example of a phase-resolved EPIC pn spectrum and its residuals 
with respect to different models. a, Spectrum from the phase interval 0.15- 
0.17 (black dots) and best-fit model of the phase-averaged spectrum, rescaled 
with a free normalization factor (red line). b, Residuals with respect to this 
model ( ie =2.75 for 25 d.f.); ¢, residuals after the addition of an absorption line 
(cyclabs model in XSPEC, with parameters as in Fig. 2; pe =0.94 for 22 d.f.); 
d, residuals with respect to an absorbed blackbody plus power-law model with 
free temperature, photon index and normalizations (kT = 1.11 + 0.06 keV and 
F=38+ 0.4; ve = 1.75 for 22 d.f.). This is one of the models (with the same 
number of free parameters) that we also explored to fit the phase-resolved 
spectra. In this case, we obtained fits of comparable quality to those with the line 
model at most phases, but worse fits in the phase interval 0.11-0.21. A joint fit 
to these five spectra gave an unacceptable 7? of 1.56 for 116 d.f., to be compared 
with 7? of 1.08 for the absorption line model (same number of d.f.). Horizontal 
error bars indicate the energy channel width; vertical error bars, 1s.d.; residuals 
o are in units of standard deviations. 


in 2004 by SGR 1806-20**””. If a similar scenario, albeit on a reduced 
scale, occurred during the outburst of SGR 0418+5729, a spectral 
feature might arise as thermal photons from the hotspot (a small 
hot region on the neutron star surface, responsible for most of the 
X-ray emission, which could be itself related to the prominence) cross 
the plasma threading the magnetic loop. A proton density ~10'7 cm * 
is needed to produce a resonant scattering depth of order unity’. 
Protons, being heavy, do not rise much above the surface and move 
subrelativistically°, so resonant scattering in the prominence is likely to 
produce a narrow feature instead of an extended tail. As the star rotates, 
photons emitted in different directions pass through portions of the 
prominence with different magnetic field, density and size, giving rise 
to the observed variations of the line centroid and width. A simple 
quantitative model based on this picture is presented in Supplementary 
Information. Results, obtained with a geometry consistent with the 
constraints derived from the X-ray pulsed fraction of SGR 0418+ 
5729, are in good agreement with the observed variations of the feature 
with phase (Fig. 1). 
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Deterministic quantum teleportation of photonic 
quantum bits by a hybrid technique 


Shuntaro Takeda', Takahiro Mizuta, Maria Fuwa', Peter van Loock* & Akira Furusawa! 


Quantum teleportation’ allows for the transfer of arbitrary unknown 
quantum states from a sender to a spatially distant receiver, pro- 
vided that the two parties share an entangled state and can commu- 
nicate classically. It is the essence of many sophisticated protocols for 
quantum communication and computation’ °. Photons are an opti- 
mal choice for carrying information in the form of ‘flying qubits’, 
but the teleportation of photonic quantum bits® ” (qubits) has been 
limited by experimental inefficiencies and restrictions. Main disad- 
vantages include the fundamentally probabilistic nature of linear- 
optics Bell measurements’’, as well as the need either to destroy the 
teleported qubit or attenuate the input qubit when the detectors do 
not resolve photon numbers'’. Here we experimentally realize fully 
deterministic quantum teleportation of photonic qubits without post- 
selection. The key step is to make use ofa hybrid technique involving 
continuous-variable teleportation’* *° of a discrete-variable, photo- 
nic qubit. When the receiver’s feedforward gain is optimally tuned, 
the continuous-variable teleporter acts as a pure loss channel'”"*, 
and the input dual-rail-encoded qubit, based ona single photon, repre- 
sents a quantum error detection code against photon loss’’ and hence 
remains completely intact for most teleportation events. This allows 
for a faithful qubit transfer even with imperfect continuous-variable 
entangled states: for four qubits the overall transfer fidelities range 
from 0.79 to 0.82 and all of them exceed the classical limit of tele- 
portation. Furthermore, even for a relatively low level of the entangle- 
ment, qubits are teleported much more efficiently than in previous 
experiments, albeit post-selectively (taking into account only the 
qubit subspaces), and with a fidelity comparable to the previously 
reported values. 

Since originally being proposed’, the concept of quantum teleporta- 
tion has attracted a lot of attention and has become one of the central 
elements of advanced and practical realizations of quantum information 
protocols. It is essential for long-distance quantum communication by 
means of quantum repeaters’, and it has been shown to be a useful tool 
for realizing universal quantum logic gates in a measurement-based 
fashion®. Many proposals and models for quantum computation rely 
on quantum teleportation, such as the efficient linear-optics quantum 
computing scheme of ref. 4 and the ‘one-way’ quantum computer using 
cluster states”. 

Although much progress has been made in demonstrating quantum 
teleportation of photonic qubits*”, most of these schemes share one fun- 
damental restriction: an unambiguous two-qubit Bell-state measurement 
(BSM), as is needed to teleport a qubit using two-qubit entanglement, 
is always probabilistic when linear optics is used, even if photon-number- 
resolving detectors are available'*’’. There are two experiments avoid- 
ing this constraint, but in these either a qubit can no longer be teleported 
when it is delivered independently from an external resource’ or an 
extra nonlinear element leads to extremely low measurement efficien- 
cies, of the order of 10° 1° (ref. 8). A further experimental limitation, 
rendering these schemes fairly inefficient, is the probabilistic nature of 
the entangled resource states’’. Efficient, near-deterministic quantum 
teleportation, however, is of great benefit in quantum communication, 


where it can be used to reduce the storage times of quantum memories 
in a quantum repeater, and it is a necessity in teleportation-based 
quantum computation. An additional drawback of the previous 
experiments, owing to the lack of photon-number-resolving detectors, 
was the need either to destroy the teleported qubit” or to attenuate the 
input qubit’’, thus further decreasing the success rate of teleportation. 
We overcome all the above limitations by taking a different approach: 
continuous-variable quantum teleportation of a photonic qubit. The 
strength of continuous-variable teleportation lies in the on-demand avai- 
lability of the quadrature-entangled states and the completeness of a B3M 
in the quadrature bases using linear optics and homodyne detections”». 
So far, these tools have been used to unconditionally teleport continuous- 
variable quantum states such as coherent states'®”'. However, it has not 
yet been possible to apply them to qubits'*”’, because typical pulsed- 
laser-based qubits (such as those in refs 6-11) have a broad frequency 
bandwidth that is incompatible with the original continuous-wave-based 
continuous-variable teleporter, which works only on narrow sidebands'*”’. 
We overcome this incompatibility by using very recent, advanced tech- 
nology: a broadband continuous-variable teleporter” and a narrowband 
time-bin qubit compatible with that teleporter™*. Importantly, this qubit 
uses two temporal modes to represent a dual-rail-encoded qubit'* 


IW) =a0, 1) + f]1, 0) 


where |0, 1) and |1,0) refer to the temporal modes of the photon 
(expressed in the two-mode photon-number basis, with ||” + |B]? = 1). 
Therefore, teleportation of both modes of the qubit is accomplished by 
means of a single continuous-variable teleporter acting subsequently 
on the temporal modes of the time-bin qubits (Fig. 1). 

Remarkably, the main weakness of continuous-variable teleporta- 
tion, namely the intrinsic imperfection of the finitely squeezed, entang- 
led states, can be circumvented to a great extent in the present ‘hybrid’ 
setting when the input to the continuous-variable teleporter is a dual- 
rail qubit. The entangled state of the teleporter is a two-mode squeezed, 
quadrature-entangled state, ,/1—g>.. yeaa Sopt|71, 2), here written 
in the number basis with g,,, = tanh(r), where r is the squeezing para- 
meter. Because infinite squeezing (r—> %) requires infinite energy, maxi- 
mally entangled states are physically unattainable; thus, the teleportation 
fidelity is generally limited by r. Following the standard continuous- 
variable quantum teleportation protocol with unit gain for the receiver’s 
feedforward displacement” yields a largely distorted output qubit with 
additional thermal photons. 

In contrast, non-unit gain conditions are useful in some cases”*”®. In 
particular, a single-mode continuous-variable teleporter with gain gopt 
creates no additional photons, because it is equivalent to a pure attenua- 
tion channel from which an intensity fraction of 1 — Soot is lost to the 
environment’””*. Moreover, the dual-rail-encoded qubit represents a 
quantum error detection code against such amplitude damping, where 
either a photon-loss error occurs, erasing the qubit, or a symmetric 
damping leaves the input qubit state completely intact’’. These two 
facts together mean that the dual-rail continuous-variable teleporter at 
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Figure 1 | Experimental set-up. A time-bin qubit is heralded by detecting one 
half of an entangled photon pair produced by an optical parametric oscillator 
(OPO). The continuous-variable teleporter (g, feedforward gain) always 

transfers this qubit, albeit with non-unit fidelity. The teleported qubit is finally 


optimal gain, Sopt> transforms the initial qubit state as follows: 


I) (l->g2relW) (1+ (1 B2pe) 10, 0) (0, 0 (1) 


Most importantly, no additional photons are created and the quan- 
tum information encoded in |) remains undisturbed regardless of the 
squeezing level. The only effect of the teleporter is the extra two-mode 
vacuum term, whose fraction becomes arbitrarily small for sufficiently 
large squeezing, go> 1. This technique allows us to teleport arbitrary 
qubit states more faithfully by suppressing additional photons, thereby 
realizing unconditional teleportation with a moderate level of squeez- 
ing. Equation (1) also shows that a fidelity of unity is obtainable for any 
non-zero squeezing level, g,,. > 0, provided that the signal qubit sub- 
space is post-selected, that is, the non-occurrence ofa photon-loss error 
is detected with a probability approaching zero for g,,; = 0. We note 
that the remaining vacuum contribution could be made arbitrarily small 
without post-selection of the final states, by instead immediately discar- 
ding all quadrature results of the BSM that are too far from the phase- 
space origin’””’, 

To demonstrate successful qubit quantum teleportation, we prepare 
four distinct qubit states: |0, 1),|1, 0), |W.) =(|0, 1) —i|1, 0))/W2and 
|W) = (2)0, 1) —|1, 0)) /\/5. This set, including even and uneven super- 
positions of |0, 1) and | 1, 0) with both real and imaginary phases, represents 
a fair sample of qubit states on the Bloch sphere. In theory, our teleporter 
acts on any qubit state in the same way (Supplementary Discussion). 

The experimental density matrix of the input state, ly 1), is shown in 
Fig. 2a. This input state is not a pure qubit state, but rather a mixed state 
with a 25 + 1% vacuum contribution, a 69 + 1% qubit contribution and 
a 6+ 1% multiphoton contribution. Because the continuous-variable 
teleporter transfers input states of arbitrary dimension, all of these com- 
ponents are teleported and constitute the final, mixed output state. We 
note that in our first analysis we do not discard any of these contribu- 
tions from the input or the output states, thus ensuring that none of the 
quantum states that enter or leave our teleporter is pre-selected or post- 
selected, respectively. 

First we present the output state of unit-gain teleportation with 
r= 1.01 + 0.03 (Fig. 2b). All the matrix elements obtained are in good 
agreement with theory: the qubit contribution decreases, whereas the 
contribution of the multiphoton terms grows owing to the finite squee- 
zing. The off-diagonal elements of the qubit (|0, 1)(1, 0], |1, 0X0, 1|) retain 
the original phase information of the input superposition between 
|0,1) and |1,0), demonstrating that the non-classical feature of the 
qubits is preserved during the teleportation process. These off-diagonal 
elements, however, decay a little more rapidly than do the diagonal 
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characterized by single or dual homodyne measurement to verify the success of 
teleportation. See Methods Summary for details. APD, avalanche photodiode; 
EOM, electro-optic modulator; HD, homodyne detector; LO-x and LO-p, local 
oscillators to measure x and p quadratures, respectively. 


elements (|0, 1){0, 1], |1,0)<1, 0]), illustrating that the quantum superposi- 
tion of the qubit is the feature most susceptible to error in an experi- 
mental situation. 

Next we turned down the gain, g, and observed the new output state. 
Figure 2c shows the output state at g = 0.79 (close to go¢ = 0.77). Com- 
pared with Fig. 2b, from Fig. 2c it can be seen that the qubit components 
are almost undisturbed, but that the vacuum grows and the occurrence 
of extra multiphoton components is suppressed. Thus, here the input- 
output relation is similar to the pure-attenuation model with a loss 
fraction of 1 — Sept =0.41. The bar graph in Fig. 3 shows the g depend- 
ence of the qubit and multiphoton components in the output state, 
clearly demonstrating that gain tuning reduces the creation of addi- 
tional photons in continuous-variable teleportation. 

The performance of the teleporter can be assessed by means of the fide- 
lity. In our deterministic scheme, we must take into account the vacuum 
and multiphoton contributions, which was not the case in previous non- 
deterministic teleportation experiments using post-selection. The ove- 
rall fidelity between the input state, ;,, and the output state, fo, ;, is”® 


Fetate = In ( V PinPout Vin) 


When p,,, has a qubit fraction of 7, the classical bound on Fyate is 
Fy, = 1 — y/3, which is the best fidelity achievable without entangle- 
ment (Supplementary Discussion). Therefore, Fytate > Finr is a success 
criterion for unconditional quantum teleportation. Alternatively, we 
may also assess our teleporter by calculating the fidelity after post- 


2 


selecting the qubit components alone: Foupit = (| pur |), where |W) 


is the ideal qubit state and a is obtained by extracting from the 


output density matrix the qubit subspace spanned by {|0, 1), |1,0)} and 
then renormalizing it. We note that Fgupie > 2/3 is the success criterion 
of post-selective teleportation with a pure input qubit and a mixed out- 
put qubit”. 

As shown in Fig. 3, the g dependence of these two fidelities is 
in good agreement with the theoretical predictions. The maximal 
fidelities are obtained at g=0.79. Most importantly, here, we 
satisfy not only the usual qubit-subspace teleportation criterion, 
Foubit = 0.875 + 0.015 > 2/3, but also the fully non-post-selected, 
Fock-space criterion, Fytate = 0.817 + 0.012 > Fy, = 0.769 + 0.004, thus 
demonstrating deterministic, unconditional quantum teleportation of 
a photonic qubit. As well as for the input qubit, |\/1), the Fock-space 
criterion is also fulfilled for the other three qubit states, |0, 1), |1,0) and 
|W2), with the same experimental r and g values, for which states Fytate 
values of 0.800 + 0.006, 0.789 + 0.006 and 0.796 + 0.011 were respecti- 
vely observed (theoretically, Ferate and Foubit are independent of the 
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Figure 2 | Experimental density matrices. By means of homodyne 
tomography, two-mode density matrices are reconstructed both for 

the input and the output states in photon-number bases”: 

P= DV kiiman=o Pkimnlks 1) (m, n|. The bars show the real or imaginary parts 


qubit; see Supplementary Discussion and Supplementary Data). We 
note that, although the pure-attenuation model predicts Foubit = 1 and 
a complete suppression of multiphoton terms at gain g,,., our results 
slightly deviate from that situation owing to experimental imperfec- 
tions, such as extra loss and phase fluctuations of the squeezing. 
Finally, Fig. 2d shows the output state for the lower squeezing level, 
r= 0.71 + 0.02, and g = 0.63 (go, = 0.61). Here, although the vacuum 
component becomes more dominant, the qubit components retain 
almost the same form as in Fig. 2c. Under these circumstances, the success 
of teleportation is only post-selective (Foubit = 0.879 + 0.015 > 2/3, 
Fytate < Finr), Owing to the insufficient squeezing resource. However, the 
overall success probability for transferring the qubits (43 + 3%, the ratio of 
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Figure 3 | Experimental results of teleportation including gain tuning. The 
horizontal axis, showing g, uses a logarithmic scale. Orange and green bars 
respectively represent qubit and multiphoton components of the teleported states 
(the left-hand vertical axis). Red diamonds and blue circles with error bars (1 s.d.) 
correspond to Foupit and Fate, respectively (the right-hand vertical axis). Theoretical 
fidelity curves (Supplementary Information) are also plotted, in the same colours. 
All observed Foupit values significantly exceed the classical limit of 2/3. For g = 0.79, 
Fate > 1 — 1/3 and, thus, unconditional teleportation is demonstrated. 
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of each matrix element Pxinn. Blue, red and green bars correspond to the 
vacuum, qubit and multiphoton components, respectively. a, Input state, 
b-d, Output states for different values of r and g. 


Wi). 


the input and output qubit components) is still much higher than in 
previous experiments (<1%), even for this relatively low squeezing level. 
This shows the great advantage of our hybrid approach over the standard 
approaches. 

In conclusion, we experimentally realized unconditional quantum 
teleportation of four distinct photonic qubit states, exceeding the fide- 
lity limits of classical teleportation in a deterministic fashion. In our 
scheme, once the input qubit states are prepared, there is no need to 
preprocess or post-select them, and the teleported states freely emerge 
at the output of our teleporter. 


METHODS SUMMARY 


Our experimental set-up is shown in Fig. 1. The time-bin qubit is conditionally 
created at a rate of ~5,000s using a continuous-wave laser** (wavelength, 
860 nm), by extending the technique of ref. 30. Each time bin has a frequency 
bandwidth of 6.2 MHz around the laser frequency. Our continuous-variable 
teleporter’ operates continuously with a bandwidth of 12 MHz around the laser 
frequency, which is sufficiently wide to cover the qubit bandwidth—ultimately 
enabling us to teleport qubits in a deterministic fashion. In our teleporter, two 
single-mode squeezed states (each with an ideal, pure squeezing parameter r) from 
two optical parametric oscillators are suitably mixed at a 50:50 beam splitter to 
generate the quadrature-entangled beams. This entanglement source is perma- 
nently available with no need for any probabilistic heralding mechanism. At the 
sending station of the teleporter, the input qubit is first combined with one half of 
the entangled beams at a 50:50 beam splitter. A complete continuous-variable 
BSM is then performed by measuring the two output modes of the beam splitter 
through two homodyne detections of two orthogonal quadratures. These homo- 
dyne signals are classically communicated to the receiving station, where they are 
multiplied by a gain factor (g) and fed forwards by means of a displacement opera- 
tion on the other half of the entangled beams. Time synchronization of this final 
displacement is achieved by introducing an optical delay to the corresponding 
entangled beam. Finally, the output state is characterized by single or dual homodyne 
measurement”. For every state, 100,000 sets of quadrature values are recorded and 
the corresponding two-mode density matrix is reconstructed using the maximum- 
likelihood technique without compensating finite measurement efficiencies. 
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Deterministic quantum teleportation with 
feed-forward in a solid state system 
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Engineered macroscopic quantum systems based on superconduct- 
ing electronic circuits are attractive for experimentally exploring 
diverse questions in quantum information science’ ’. At the current 
state of the art, quantum bits (qubits) are fabricated, initialized, 
controlled, read out and coupled to each other in simple circuits. 
This enables the realization of basic logic gates*, the creation of 
complex entangled states*® and the demonstration of algorithms’ 
or error correction®. Using different variants of low-noise parametric 
amplifiers’, dispersive quantum non-demolition single-shot readout 
of single-qubit states with high fidelity has enabled continuous’’ and 
discrete" feedback control of single qubits. Here we realize full deter- 
ministic quantum teleportation with feed-forward in a chip-based 
superconducting circuit architecture'?*. We use a set of two para- 
metric amplifiers for both joint two-qubit and individual qubit sin- 
gle-shot readout, combined with flexible real-time digital electronics. 
Our device uses a crossed quantum bus technology that allows us to 
create complex networks with arbitrary connecting topology in a pla- 
nar architecture. The deterministic teleportation process succeeds 
with order unit probability for any input state, as we prepare maxi- 
mally entangled two-qubit states as a resource and distinguish all Bell 
states in a single two-qubit measurement with high efficiency and 
high fidelity. We teleport quantum states between two macroscopic 
systems separated by 6 mm ata rate of 10*s ', exceeding other repor- 
ted implementations. The low transmission loss of superconducting 
waveguides is likely to enable the range of this and other schemes to be 
extended to significantly larger distances, enabling tests of non-loc- 
ality and the realization of elements for quantum communication at 
microwave frequencies. The demonstrated feed-forward may also 
find application in error correction schemes. 

Transferring the state of an information carrier from a sender to a 
receiver is an essential primitive in both classical and quantum communi- 
cation and information processing. In a quantum mechanical context, 
teleportation describes the process of transferring an unknown quantum 
state between two parties at two different physical locations without 
transferring the physical carrier of information itself. Instead, teleporta- 
tion makes use of the non-local correlations provided by an entangled 
pair shared between the sender and the receiver and the exchange of 
classical information’. This concept is central to extending the range 
of quantum communication using quantum repeaters’® and can also 
be used to implement logic gates for universal quantum computation”. 

In the original teleportation protocol’*, the unknown state |Win) of 
qubit Q1 in possession of the sender is transferred to the receiver's 
qubit, Q3 (Fig. 1a). To enable this task, sender and receiver prepare in 
advance a maximally entangled (Bell) state between an ancillary qubit 
Q2, which remains with the sender, and Q3, which is located at the 
receiver. Then the sender performs a measurement of Q1 and Q2 in the 
Bell basis, which projects the two qubits in the sender’s possession onto 
one of the four possible Bell states |o* ) =(|00) + |11)) /,/2 and 
he ) =(|01) +|10)) /,/2. As a consequence the receiver’s qubit Q3 
is projected instantaneously and without ever having interacted with 
the sender’s qubit QI onto a state |W.) = {L X, Z, Y} |Win) (where 


out 


X=6,,Z=6, and Y= id are the Pauli matrices), which differs from 
the input state |y;,,) only by a single-qubit rotation, depending on the 
four possible measurement results. To always recover the original state 
|Win) the receiver has to rotate the output state of Q3 conditioned on 
the outcome of the Bell measurement communicated to the receiver as 
two bits of information by means of a classical channel. This final step 
is frequently referred to as feed-forward, because the outcome of a 
measurement performed on one part of a system is used to control 
another part of the same quantum system. This is in contrast to acting 
back on the same quantum system in a feedback process. 

The success of the teleportation protocol in every instance with unit 
fidelity is counterintuitive from a classical point of view. The receiver’s 
qubit does not interact with any other qubit after |y;,) is prepared. The 
classical information sent by the sender is not sufficient to recreate 
|Win) perfectly at the receiver. Indeed, assuming no entanglement 
between sender and receiver, one can replicate the sender’s state at 
best with a fidelity of 2/3 (ref. 18) because only a fraction of informa- 
tion about |n) is obtained by a single projective measurement. 
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Figure 1 | Circuit diagram of quantum teleportation. a, The original 
protocol starts with the preparation of a Bell state between Q2 and Q3 (blue 
box), followed by the preparation of an arbitrary state |~;,) (green box) and a 
Bell-state measurement of Q1 and Q2 (red box). The classical information 
extracted by the measurement of Q1 and Q2 is transferred to the receiver to 
perform local gates conditioned on the measurement outcomes. After the 
protocol, Q3 is in state |Y/u:), which ideally is identical to |;,) (also coloured 
green). Here, H is the Hadamard gate, and X and Z are Pauli matrices. The CNOT 
gate is represented by a vertical line between the control qubit (e) and the target 
qubit (@). b, The protocol implemented in our experiment uses controlled- 
PHASE gates indicated by vertical lines between the relevant qubits (e), and 
single-qubit rotations R°, y of angle @ about the +y axis. To finalize the 
teleportation we either post-select on any single one of the four measurement 
outcomes (00, 01, 10 and 11) acquired in a single shot, or we deterministically 
use all four outcomes, which we then may use to implement feed-forward. The 
feed-forward operators R and R¥ are applied to Q3 conditioned on the four 
measurement outcomes according to the table presented in the box framed bya 
black line. 
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In pioneering work the teleportation protocol was first implemented 
with single photons” over laboratory-scale distances and later also over 
kilometre-scale distances in free space”®. However, in these experiments 
only two out of four Bell states were distinguished unambiguously, limi- 
ting the efficiency of the protocol to 50% at best. A proof-of-principle 
experiment that can distinguish all four Bell states was implemented 
using nonlinear photon interaction”, but the efficiency of the detection 
step was much below 1%. With photonic continuous-variable states, 
teleportation has been achieved deterministically for all measurement 
outcomes and the final conditional rotation has been implemented to 
complete the teleportation protocol’’. In atomic qubits, fully determin- 
istic quantum teleportation has been realized over micrometre-scale 
distances with ions in the same trap****. Non-deterministically the pro- 
tocol has also been implemented between ions in different traps” and in 
atomic ensembles”. 

The experimental realization of teleportation faces several import- 
ant challenges that have been sequentially addressed over many years 
of research in most physical realizations. The ideal protocol requires 
the creation of an entangled pair shared between the sender and 
the receiver, a two-qubit measurement identifying all four Bell states 
at the sender, and feed-forward of the classical information to perform 
the final qubit state rotation to complete the protocol, all at high effi- 
ciency and fidelity. In addition it is frequently desired to run the pro- 
tocol at a high rate and over large distances to maximize its usefulness. 

Here we demonstrate in a solid state system post-selected teleporta- 
tion, deterministic teleportation and deterministic teleportation with 
feed-forward in a single experimental setting. Our experiment addresses 
all of the above criteria with the exception of realizing space-like sepa- 
ration between the sender and the receiver. The coherent part of the 
teleportation protocol presented here has previously been characterized 


Figure 2 | Diagrams of sample and measurement setup. a, Chip design 
including three resonators R1, R2 and R3 (black) with corresponding input and 
output lines (red) used for readout and coupling of three transmon qubits Q1, 
Q2 and Q3 (orange). The fourth qubit in the lower right corner of the chip is not 
used. The local microwave charge gate lines (green) are used for single-qubit 
rotations; the local flux-bias lines (blue) permit nanosecond time control of the 
qubit frequencies to implement two-qubit operations. Below: false-colour 
micrographs of Q] (left) coupled to resonator R1, and Q2 (right) coupled to 
both resonators R1 and R2. The microfabricated aluminium air bridges visible 
as bright white strips realize crossovers for the resonator lines. Air bridges are 
also used to suppress spurious electromagnetic modes by connecting the 
ground planes across the coplanar wave guides. b, Simplified diagram of the 
measurement setup with the same colour code as in a. c, Amplitude-controlled 
and phase-controlled microwave pulses are applied to the qubits using 
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in our laboratory by performing full-state tomography of three qubits 
interacting with a single mode field”. That work has allowed us to test 
the protocol up to, but not including, the essential single-shot readout 
and feed-forward steps, which are the focus of this study. 

In the realization of teleportation presented here, we use three super- 
conducting transmon qubits* (Q1, Q2 and Q3) coupled to three super- 
conducting coplanar waveguide resonators (R1, R2 and R3) in a circuit 
quantum electrodynamics setup’* (Fig. 2). At the sender, qubits Q1 and 
Q2 are coupled capacitively to resonator R1; at the receiver, Q3 is coupled 
to R3. The individual resonators R1 and R3 allow the sender and the 
receiver to perform independent measurements of their qubits, as detailed 
below. In addition, Q2 and Q3 are coupled to R2. The resonators R2 and 
RI act as quantum buses to realize two-qubit controlled-PHASE (CPHASE) 
gates’ used to create a shared Bell state distributed between the sender and 
the receiver and to perform a deterministic Bell state analysis at the 
sender. We perform single-qubit rotations with an average fidelity of 
greater than 98% by applying amplitude-controlled and phase-controlled 
microwave pulses through individual charge gate lines. The transition 
frequency of each qubit is controlled by individual flux bias lines. 

We start the protocol by heralding the three-qubit ground state 
based on a single-shot measurement of all qubits. Using single-qubit 
rotations and a CPHASE gate with process fidelity 90%, a Bell state 
with state fidelity 92% is created on demand between qubits Q2 
and Q3 that is shared between the sender and the receiver (blue box 
in Fig. 1b). Only after the creation of the shared entanglement, we use 
a single-qubit rotation to prepare the state of qubit Q1 to be teleported. 

As an essential part of the protocol, we perform a Bell measure- 
ment of qubits Q1 and Q2 at the sender by mapping the Bell basis 
{]®~), |), |®*), |\¥")} onto the computational basis {|00),|01), 
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sideband modulation of an up-conversion in-phase quadrature (IQ) mixer 
driven by a local oscillator (LO) and modulated by an arbitrary waveform 
generator (AWG). d, The measurement signals transmitted through R1 and R3 
are amplified using Josephson parametric amplifiers (JPA) pumped by an LO 
through a directional coupler (D), which is also used for phase (#) and 
amplitude (A) controlled cancellation of the pump leakage, and coupled to and 
isolated from the sample by two circulators (C). e, The signal is further 
amplified by high-electron-mobility transistor (HEMT) amplifiers at 4K and 
chains of ultra-low-noise (ULN) and low-noise (LN) room-temperature (RT) 
amplifiers. f, The transmission signal of both resonators is down-converted to 
an intermediate frequency (IF) in an IQ mixer pumped by a dedicated LO, 
digitized, and fed into field-programmable gate array (FPGA) logic for real- 
time data analysis and triggering the conditioned feed-forward step. 
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Fig. la). We realize this basis transformation by using single-qubit rota- 
tions and a CPHASE gate. Then we perform a projective joint readout” of 
the states of Q1 and Q2 by measuring the transmission amplitude and 
phase of resonator R1. A given Bell state is transformed to the corres- 
ponding computational basis state, resulting in the corresponding out- 
put state |Wout) = {1 X, Z, Y} |Win) of Q3. 

To perform a post-selected teleportation protocol, it is sufficient to 
distinguish only one of the four Bell states, say |00), ideally occurring 
with a probability of 1/4, with high fidelity from all other states 
(01), |10) and |11)) by using the aforementioned scheme. This is pos- 
sible in our setup by performing a dispersive single-shot qubit readout 
with a fidelity of (91.0 + 0.2)%. For this purpose we amplify a mea- 
surement tone applied to R1 with a Josephson parametric amplifier 
operated in the phase-sensitive mode, in which no or only very little 
noise is added to the signal’. If the measurement of Q1 and Q? returns 
00, qubit Q3 is instantaneously projected to the desired state |/,,) not 
requiring any additional rotations at the receiver’s qubit to complete 
teleportation. 

With a second, independent, parametric amplifier, also operated in 
the phase-sensitive mode, a measurement tone transmitted through 
resonator R3 is used to read out the state of qubit Q3 with a single-shot 
fidelity of (89.1 + 0.5)%. In post-selection (ps), state tomography of 
Q3 conditioned on a 00 measurement of Q1 and Q2 reveals the ori- 
ginal input state with an average fidelity of FP*°°=(81.9+1.3)% 
(Extended Data Fig. 3a). By characterizing | out) for four linearly inde- 
pendent input states |;,,), we perform full process tomography” of the 
state transfer by teleportation from Q1 to Q3 to reconstruct the process 
matrix Yo, which has a fidelity of FP“ = (72.5 + 1.3)% with respect 
to the expected identity operation. The process fidelity F, is related to 
the average output-state fidelity F, by Fp =(Fs(d+ 1) —1)/d, where 
d is the dimensionality of the input state and output state. 


|00) = |") 


101) = |") 


GZ 


4 
" a 


Figure 3 | State-transfer process matrix for quantum teleportation. The 

experimentally obtained (solid bars) absolute values of the process matrices | 7| 
describing the state transfer from Q1 to Q3 by teleportation are displayed for 
post-selection on any single one of the Bell-measurement outcomes 00, 01, 10 
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It is important to note that in post-selection we are able to map any 
of the four Bell states to the computational basis state |00) on demand 
by applying m-pulses to Q1 and/or Q2 right before their joint readout. 
This allows us to post-select individually on any of the four Bell states and 
to determine the corresponding process matrices 700,01,10,11- Lhe experi- 
mentally obtained process matrices (Fig. 3a) agree well with the expected 
processes. The average output-state fidelity FP’ = (81.7 +1.4)% of all 
four processes is clearly above the classical limit of 2/3 (ref. 18). This 
results in an average process fidelity when post-selecting on a single Bell 
state of 2 * =(72.0+1.4)%, well above the classical limit of 1/2. The 
output-state fidelity is predominantly limited by the relaxation and 
dephasing of our qubits, which affects both the effective gate and readout 
fidelity (see Methods). 

To identify uniquely and simultaneously the four randomly distrib- 
uted outcomes of the Bell-state measurement at the sender in a deter- 
ministic way (instead of using post-selection) we use high-fidelity 
dispersive single-shot qubit readout enabled by the Josephson para- 
metric amplifier operated in the phase-preserving mode”’. In this 
mode we achieve a probability of (81.8 + 0.5)% to correctly identify 
all four pairs of basis states, by amplifying and recording both quad- 
rature amplitudes (I, Q) of the detected electromagnetic field at the cost 
of adding at least the vacuum noise to the input signal. Correlating the 
four single-shot Bell-state measurement outcomes at the sender with the 
single-shot measurement outcomes at the receiver, and performing state 
and process tomography, we find an average deterministic output-state 
fidelity of the transferred state of F¢*t = (77.1+1.2)% and an average 
process fidelity of - = (65.5+1.1)%, well above the classical limits of 
2/3 and 1/2, respectively. The process matrices (Fig. 3b) prominently 
show the characteristic features of the expected processes. The fidelities 
obtained with this method are lower than for the post-selected teleporta- 
tion, because of the lower fidelity of the deterministic Bell-state readout. 


\11y S jet) |10) = |@*y 


and 11 (a), simultaneous deterministic measurement of all four outcomes 
(b), and with feed-forward (c). The respective process fidelities are indicated in 
black boxes. The ideal |7| is indicated by wire frames. 
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To complete the teleportation protocol we have analysed the deter- 
ministic Bell-state measurement in real time by using fast electronics 
based on a field programmable gate array (FPGA). Digitally defining 
two thresholds for the single-shot measurements of the quadrature 
amplitudes I and Q performed at the sender we realize the feed-forward 
(ff) step in about 500 ns, in which the rotations J, X, YX (=2Z), Y are 
applied conditioned on the four Bell-state measurement outcomes 00, 
01, 10 and 11. We analysed more than 270,000 single-shot experiments 
and find a deterministic state-transfer fidelity of Fi! =(68.8+0.5)% 
with a process fidelity F : = (53.2+0.5)%, close to but clearly above 
the classical thresholds (Fig. 3c). The limited fidelity can be traced back 
to the fidelities of single-qubit and two-qubit operations, the readout 
fidelities and the time required for the feed-forward in relation to the 
coherence times of the qubits used. 

When averaging and appropriately normalizing the measurement 
results of Q3 directly in the FPGA, we are able to circumvent the 
limited single-shot readout fidelity of Q3. In this case, the determinis- 
tically teleported states including the feed-forward step have an aver- 
age fidelity of #2” =(77.4+0.2)%, with a corresponding process 
fidelity of F 5” = (62.2 +0.3)%. 

To characterize the feed-forward process only, we have prepared 
separable input states (|00), |01), |10) and |11)) of Ql and Q2, and 
arbitrary states |) of Q3. The input states measured and digitally 
analysed at the sender are then used to actuate the feed-forward. 
The fidelity of the resulting state of Q3 subject to feed-forward was 
determined to be F,=(79.5+1.5)% with a corresponding process 
fidelity of Fp = (72.4 + 1.5)%. These results also indicate that this type 
of deterministic feed-forward can in principle be used for demonstrat- 
ing error correction in future experiments, ifhigh enough fidelities can 
be realized. 

These results, in conjunction with continuing improvements in 
coherence time’, advances in circuit design and operation, and the 
potential for using propagating microwave photons, form a solid 
ground for future progress in quantum information processing and 
quantum communication with superconducting circuits. 


METHODS SUMMARY 


The teleportation protocol (Fig. 1b) is implemented using the pulse scheme 
depicted in Extended Data Fig. 1. Single-qubit rotations are realized by 12 ns long 
resonant microwave pulses. Controlled-PHASE gates are implemented by shifting 
one of the two qubits’ transition frequencies to the avoided level crossing between 
the |11) and |02) states. For single-shot measurements, the output signals of reso- 
nators R1 and R3 are amplified using Josephson parametric amplifiers. For the 
experiments in which we post-select on an individual Bell state, we operate the 
parametric amplifier in the phase-sensitive mode. This allows us to distinguish 
the state |00) from all other states by analysing the integrated measurement signal 
(Extended Data Fig. 2a). In the experiments in which we simultaneously distin- 
guish between all four Bell states, the parametric amplifier is used in the phase- 
preserving mode, in which both quadratures of the transmitted field are amplified. 
By adjusting the pump and readout powers we maximize the distinguishability of 
all four states by their respective quadrature amplitudes in the complex plane 
(Extended Data Fig. 2b). To implement the feed-forward, all measurement data 
are analysed in real-time in an FPGA with custom firmware. Depending on the 
measurement outcome, the FPGA triggers up to two arbitrary waveform genera- 
tors, which realize X and Y rotations. We characterize the state transfer from Q1 to 
Q3 with full process tomography by performing state tomography on the output 
state for four different linearly independent input states (Extended Data Fig. 3). 
Wealso use the teleportation protocol to verify the entanglement between Q2 and 
Q3 using six mutually unbiased input basis states. 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Sample parameters. The sample consists of three superconducting coplanar wave- 
guide resonators and three qubits of the transmon type”! as depicted in Fig. 2. The 
resonators Rl and R3 have bare resonance frequencies v, = {7.657, 9.677} GHz, 
respectively. They are coupled by gap and finger capacitors to their input and 
output lines. The overcoupled resonator decay rates are measured to be K/2n = 
{2.4, 2.5} MHz. The coupling capacitances are designed asymmetrically such that 
the decay rate through the input port is approximately 100-fold lower than through 
the output port. The resonator R2 is not coupled to any input or output line. Its 
resonance frequency is about 8.7 GHz and its decay rate is expected to be close to the 
internal decay rate*’. From spectroscopic measurements we determine the maxi- 
mum transition frequencies Vax = {6.273, 7.373, 8.390} GHz and charging ener- 
gies E-/h = {0.297, 0.303, 0.287} GHz of the qubits Q1, Q2 and Q3, respectively, 
where h is Planck’s constant. 

Qubits QI and Q2 are coupled to resonator R1 with coupling strengths 
g/2n = {0.260, 0.180} GHz, and Q3 is coupled to resonator R3 with a coupling 
strength of g/2m = 0.240 GHz. The coupling of Q2 and Q3 to R2 is estimated from 
the transverse coupling strength (see below) to be g/2 = 0.2 GHz each. 

For the presented experiments, the qubits were tuned to idle state transition 
frequencies v = {4.657, 5.390, 6.342} GHz with miniature superconducting coils 
mounted underneath the chip’. At these frequencies we have determined their 
energy relaxation T; = {5.1, 3.1, 2.6} ps and coherence times T, = {1.0, 1.1, 1.7} ps. 
Pulse scheme. All biased qubits and resonators are separated in frequency from 
each other by at least 750 MHz to suppress cross-talk. 

The protocol (Fig. 1b) is implemented with the pulse scheme depicted in Extended 
Data Fig. 1. Single-qubit rotations are implemented by resonant Gaussian-shaped 
DRAG**** microwave pulses 12 ns long with an average fidelity of more than 98% as 
determined by randomized benchmarking’. The controlled-pHasE gate is imple- 
mented by shifting the qubits with fast magnetic flux pulses to the avoided level 
crossing between the |11) and |02) states of the involved qubits’”’. The transverse 
coupling strengths of Tne / 2n=12.2 MHz (between qubits Q1 and Q2) and 
Tae / 2m = 10.4 MHz (between Q2 and Q3) lead to pulse lengths for the CPHASE 
gates of t = {40.9, 47.9} ns, respectively. 

The Bell measurement (Extended Data Fig. 1, red elements) allows us to map 

any of the four Bell states to the |00) state by adding -pulses to Q1 and Q? to flip 
their states just before starting the measurement. Instead of applying these m-pulses 
directly, we change the phases of the preceding /2-pulses accordingly, which can 
easily be verified to be equivalent. 
Qubit readout. To realize single-shot measurements, the output signals of reso- 
nators R1 and R3 are amplified by dedicated Josephson parametric amplifiers’**. 
The parametric amplifiers are similar to that used in ref. 30. They are realized as 
2/4 coplanar waveguide resonators terminated by an array of 11 superconducting 
quantum interference devices (SQUIDs), which provide the necessary nonlinearity 
and make the operation frequencies tunable by miniature superconducting coils 
on the bottom of the sample holder*’. The maximum frequencies for the two 
parametric amplifiers are Vnax = {8.349, 10.141} GHz for R1 and R3, respectively. 
To provide a fast response, large input capacitors were fabricated that resulted in a 
measured Josephson parametric amplifier line width of «/2m = {334, 548} MHz 
in the linear regime. For the experiments the parametric amplifiers were tuned to 
have a maximum gain of G = {20.4, 20.6} dB with a 3-dB bandwidth of B/2n = 
{16, 85} MHz at frequencies v.,,, = {7.686, 9.737} GHz. 

For the experiments in which we post-selected on an individual Bell state, the 
transmission of R1 was measured at the readout frequency v,. = 7.686 GHz, which 
is the mean value of the effective resonator frequencies for the qubits Q1 and Q2 in 
the state |00) and |01). The parametric amplifier is used in the phase-sensitive 
mode by tuning its transition frequency such that the maximum gain was achieved 
at the readout frequency v,. at which it was also pumped. Preparing the four 
computational basis states |00), |01), |10) and |11), applying a measurement tone 
to R1 and integrating the amplified transmission signal for 280 ns resulted in a 
distribution of the integrated measurement signals as shown in Extended Data 
Fig. 2a. We optimized for the readout contrast between the |00) and all other states. 
In this way, the mean values of the distributions of the integrated signals for the 
states |00), |10) (Extended Data Fig. 2a) have the largest separation. However, as a 
result of the finite qubit lifetime, some of the |01) and |10) states decay into the 
ground state and are visible in the data as such. We choose a threshold for the 
integrated quadrature values to discriminate 00 from all other measurement out- 
comes 01, 10 and 11 with a fidelity of 91.0 + 0.2%. 

In the experiments in which we are able to distinguish between all four Bell 
states simultaneously, the readout frequency v,,. = 7.683 GHz is chosen to be the 
mean of the effective resonator frequencies for the qubits Q1 and Q2 in the state 
|01) and |10). The parametric amplifier is used in the phase-preserving mode by 
detuning the pump frequency by 6.25 MHz from the readout frequency. In this 
way the gain G = 18.4 dB at the readout frequency and the effective bandwidth are 
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smaller than for phase-sensitive amplification, but both quadratures of the trans- 
mitted field are amplified. By preparing the computational basis states and record- 
ing the integrated transmitted signals of both quadratures (J, Q) simultaneously, 
we can map every measurement outcome to a point on the complex plane. By 
adjusting the pump power and the readout power we find settings that maximize 
the distinguishability of all four states by their location in the complex plane. 

Adjusting the phase of the local oscillator and implementing small linear offsets 

directly in the FPGA we are able to choose the J = 0 and Q = 0 axes of the complex 
plane as thresholds to identify the four different output states in real time 
(Extended Data Fig. 2b). By assigning our best estimate of the corresponding states 
to every measurement outcome according to a quadrant in the I-Q plane, we 
identify (81.8 + 0.5)% of the prepared states correctly. The probabilities of either 
correctly identifying a prepared state or misidentifying it as a different state are 
listed in Extended Data Table 1 for all four input basis states. 
Feed-forward. To implement the feed-forward, all measurement data are analysed 
in real-time in an FPGA with custom firmware. Depending on the measurement 
outcome, the FPGA triggers up to two different arbitrary waveform generators 
(AWGs), which realize X and Y rotations. To optimize the readout fidelity and 
minimize the time needed for data analysis, we calibrate the phase of the down- 
conversion local oscillator to rotate the readout data of Q1 and Q2 such that the 
four different states can be distinguished by thresholds parallel to the I and Q axes 
of the complex plane. An additional small offset is subtracted digitally such that the 
assignment of a measured signal to a given state is based solely on the quadrant of 
the complex plane it lies in (see Extended Data Fig. 2b). 

We integrate the signal for 160 ns to achieve the desired signal-to-noise ratio. 
This choice of integration time also digitally filters out the parametric amplifier 
pump tone, which is detuned by 6.25 MHz from the measurement tone. We 
experimentally determine a maximum readout fidelity by starting the integration 
time 140 ns after the initial rise of the pulsed measurement tone applied to the 
resonator. The FPGA used for data analysis and the AWG used for generating the 
feed-forward pulses introduce a combined delay of 140 ns; the total cable delay is 
65ns. As a result, the conditional rotations are applied to Q3 505ns after the 
measurement tone has been applied to the readout resonator R1 (Extended 
Data Fig. 1). To mitigate dephasing of Q3 during this feed-forward delay time, 
we apply a series of four dynamical decoupling pulses to Q3. 

We have also fully characterized the fidelity of the feed-forward process. After 
preparing the state indicated in the rows of Extended Data Table 2 and performing 
the conditional rotations indicated in the columns of Extended Data Table 2, the 
fidelity of the applied feed-forward process was determined. The diagonal ele- 
ments indicate the fidelity of the desired feed-forward operations occurring with 
the probabilities shown in Extended Data Table 1. The off-diagonal elements 
indicate the fidelity of the unwanted rotations occurring with a smaller probability 
(Extended Data Table 1) but a similar fidelity. 

Efficiency of the teleportation protocol. The total efficiency of the implemented 
teleportation protocol is close to unity. The main limitation is currently set by the 
heralding process of the initial ground state of all qubits. Before each individual 
experimental realization of the teleportation protocol, we apply a measurement 
tone 500 ns long to both resonators, to verify that all the qubits are in their ground 
states. This heralds the ground state with a total efficiency of more than 80% when 
using the parametric amplifiers in the phase-sensitive mode. In the phase-preserving 
mode we herald the ground state with ~65% probability while performing single- 
shot readout. This is in stark contrast to experiments in which optical photons are 
used as qubits, where the maximal reported efficiency is 0.1% (ref. 40). Using active 
initialization schemes for superconducting circuits’, the efficiency of future 
experiments can probably be improved to approach 100% for sufficiently long qubit 
coherence times. 

State and process tomography. To characterize the state transfer from Q1 to Q3 
we performed full process tomography”. Performing state tomography on the 
output state |y/o4.) for four different input states |W;,) = |0),|1), (|0) +]1)) i wh 2; 
(0) —i|1)) /,/2 (Extended Data Fig. 3), we obtain the process matrix 7 through 
linear inversion. In state tomography we measure a state prepared repeatedly with 
a set of orthogonal measurement operators realized by applying either no pulse, a 
m/2-pulse about the x or y axis or a 1-pulse to the qubit before measurement of its Z 
eigenstate. 

Entanglement verification in a teleportation protocol. The initial shared two- 
qubit state was characterized by performing state tomography immediately after 
its generation, instead of continuing the teleportation protocol. The resulting Bell 
state has a fidelity of 92% and a concurrence of 0.89, demonstrating the entangled 
nature of the state. In addition it is also possible to use the teleportation protocol 
itself as method for entanglement verification”. If the average output-state fidelity 
for six mutually unbiased input basis states is above the classical threshold of 2/3, it 
unambiguously shows that the two parties shared an entangled state, even if one 
has no knowledge about the details of the protocol itself. We performed the 
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teleportation experiment with feed-forward also with the six mutually unbiased 
input basis states |W;,) = |0), |1), (|0) +|1))/./2, (0) —i|1) ))//2, (0) —|1)) /./2 
and (|0) + i|1)) /,/2 and found an average output-state fidelity 7 = (69.3 +0.1)% 
with a process fidelity Fes = (53.9+0.1)%. 

Error budget. The finite roohenedes and dephasing times of our qubits are a source 
of error that limits the output-state fidelity. The fidelity of the measurement of Q3 
through R3 affects the state fidelity of |Wou) directly. From the measured prob- 
abilities of correctly identifying the states |0) and |1) on Q3 we calculate the limit of 
the output-state fidelity through this source of error to be F, =94%. In addition, 
the misidentification of the Bell states of Q1 and Q2 leads to an effective dephasing 
of |/out)- This limits the fidelity further to F, =89% and F, = 84% for the respect- 
ive cases in which we post-select on one Bell state only and in which we distinguish 
all Bell states with each measurement. Because both of these numbers are about 
7-8% higher than the actually measured fidelities, it is plausible to assign the 
remaining errors to the limited gate fidelities. Determining the gate errors inde- 
pendently shows that we perform single-qubit operations with a fidelity greater 
than 98% and create Bell states with a fidelity of 92% using two-qubit CPHASE gates 
with a process fidelity of 90%. 
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Extended Data Figure 1 | Pulse sequence of the teleportation protocol with 
feed-forward. The pulses implement the creation of an entangled pair between 
Q2 and Q3 (blue), the preparation of the state to be teleported on Q1 (green), 
the basis transformation from the Bell to the computational basis and the 
subsequent readout of Q1 and Q2? (red), the dynamical decoupling (DD) pulses, 
conditional rotations and the state tomography on Q3 (green). Gaussian- 
shaped sinusoids represent the microwave pulses applied to the respective 
charge bias lines of the qubits; sinusoids on the resonators represent the readout 
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tones; and the squares labelled cPHASE represent the flux pulses that shift the 
frequency of a qubit to implement a controlled-PHAsE gate between the marked 
qubits, where the interaction is mediated through the resonator indicated with a 
bar of the same colour as the flux pulse. The inset shows the time used for 
implementing the conditional feed-forward rotations. The total feed-forward 
time is the sum of the ramp-up time of the measurement tone, the integration 
time of the measurement signal and the delay times induced by the FPGA signal 
processing, the AWG trigger and the cables. 
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Extended Data Figure 2 | Characterization of the joint readout of Q1 and 
Q2. a, Histogram of the integrated signal quadrature-amplitude amplified 
phase-sensitively when preparing the states |00) (blue), |01) (red), | 10) 
(yellow) and |11) (green). b, Scatter plot of integrated (J, Q) quadratures of the 
measurement signal amplified in the phase-preserving mode when preparing 
the states |00) (blue), |01) (red), |10) (yellow) and |11) (green). 
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Extended Data Figure 3 | Characterization of the output states. Real (blue) averaged readout on Q3 while performing fully deterministic teleportation with 
and imaginary (red) parts of the reconstructed density matrices of the state feed-forward. The ideally expected outcomes are indicated with wireframes. 

| Wout) for the indicated input states | Win) obtained from statetomography when The state fidelities are indicated in the black boxes. 

(a) post-selecting data on a 00 outcome of the Bell measurement (b) using 
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Extended Data Table 1 | Success probabilities for the joint readout 


00 01 10 1 
100) 0.88 0.09 0.01 0.02 
01) 0.11 0.79 0.08 0.02 
10) 0.06 0.10 0.77 0.06 
|11) 0.02 0.03 0.08 0.87 


Probabilities of identifying prepared input states (rows) as the indicated output state (columns). Correct 
identifications are diagonal elements; misidentifications are off-diagonal elements. 


©2013 Macmillan Publishers Limited. All rights reserved 


Extended Data Table 2 | Process fidelities of the feed-forward pulses 


|00) ® hp) 
101) @ Xlp) 
|10) @ Z|p) 
111) @ Ylp) 


0.66 


0.73 


xy Z 
0.74 0.64 
0.73 0.65 
0.66 0.65 
0.62 0.75 


After preparing the indicated input states (rows) and performing the conditional operations (columns), 
the fidelity of the expected process is determined (elements of the table). The rows indicate the fidelity 
of the performed rotation conditioned on the measurement result of Q1 and Q2, which occur with the 


probabilities shown in Extended Data Table 1. 
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Tunable near-infrared and visible-light 
transmittance in nanocrystal-in-glass composites 


Anna Llordés', Guillermo Garcia’, Jaume Gazquez* & Delia J. Milliron! 


Amorphous metal oxides are useful in optical'’, electronic? * and 
electrochemical devices*’. The bonding arrangement within these 
glasses largely determines their properties, yet it remains a challenge 
to manipulate their structures in a controlled manner. Recently, we 
developed synthetic protocols for incorporating nanocrystals that 
are covalently bonded into amorphous materials*’. This ‘nanocrystal- 
in-glass’ approach not only combines two functional components in 
one material, but also the covalent link enables us to manipulate the 
glass structure to change its properties. Here we illustrate the power 
of this approach by introducing tin-doped indium oxide nanocrys- 
tals into niobium oxide glass (NbO,), and realize a new amorphous 
structure as a consequence of linking it to the nanocrystals. The 
resulting material demonstrates a previously unrealized optical 
switching behaviour that will enable the dynamic control of solar 
radiation transmittance through windows. These transparent films 
can block near-infrared and visible light selectively and indepen- 
dently by varying the applied electrochemical voltage over a range of 
2.5 volts. We also show that the reconstructed NbO, glass has super- 
ior properties—its optical contrast is enhanced fivefold and it has 
excellent electrochemical stability, with 96 per cent of charge capa- 
city retained after 2,000 cycles. 

Enhanced properties have been demonstrated when nanocrystalline 
precipitates form within a glassy matrix'®"’. These crystalline domains 
can harden bulk glasses”°, introduce optical nonlinearity" or lead to ano- 
malous transparency”. Anecdotal evidence also suggests that ion trans- 
port may be enhanced at crystal—glass interfaces. For instance, when 
LiAISiO, was partially crystallized, fast relaxation of ions was observed”, 
whereas precipitating TiO, nanocrystals within mesoporous P2O; glass 
led to high-power lithium (Li) ion battery electrodes’. Structural mod- 
ifications at crystal-glass interfaces have been proposed to explain such 
observations, but no direct evidence has been reported owing to the 
limited structural and compositional control offered by those prepara- 
tion methods. Conventional methods of preparing glass ceramics rely 
on thermally induced phase separation and in situ crystallization pro- 
cesses, which are very complex to control experimentally’’. The time- 
temperature parameter space for selective nanocrystallization is typi- 
cally narrow and strongly influenced by the glass composition, structure 
and thermal history’’. Therefore, determining the heat-treatment con- 
ditions required for selective crystallization of a desired composition 
phase with high control over the interface area becomes not only a 
challenging and tedious task, but also a nearly impossible task for 
complex multicomponent stoichiometries (for example, doped nano- 
crystals). Here we show how to control these characteristics by using 
well-defined colloidal nanocrystals that are post-synthetically embed- 
ded in and chemically linked to glass. 

In our colloidal approach, chemical bonds are made by combining 
ligand-stripped Sn-doped In,O3 (ITO) nanocrystals’”'* and polyniobate 
clusters (POMs) in solution’. This process results in POM-stabilized 
nanocrystal dispersions (Fig. 1a, Supplementary Fig. 2), from which 
films are deposited and then thermally annealed in air, at T = 400 °C 
(Fig. 1b). X-ray diffraction (XRD) analysis of as-deposited and annealed 


films shows that the crystalline molecular POMs condense to an amorph- 
ous NbO, matrix (Supplementary Fig. 3). In addition, XRD patterns 
(Fig. 1c, d) and scanning transmission electron microscopy cross-sectional 
images (Fig. le, f) confirm the amorphous nature of the NbO, matrix 
and that the embedded ITO nanocrystals remain highly crystalline. This 
synthetic approach is very flexible, permitting the selection of unusual 
combinations of components to tailor functionality. For instance, ITO 
nanocrystals and amorphous NbO,, were specifically selected for their 
spectrally distinct electrochromic response. The volume fraction was 
finely tuned over a broad range, from 0% to 69%, by simply adding 
additional POM to the aqueous POM-stabilized nanocrystal dispersion 
(Fig. 1a). This tunability is key to revealing the reconstruction of the 
NbO, glass structure induced by its covalent linkage to ITO nanocrystals. 

To evaluate the structure of the glass, the Raman spectrum ofa pure 
amorphous NbO,, film was compared to those of ITO-in-NbO, with 
different nanocrystal—glass interfacial areas (Fig. 2). Vibrational bands 
of the distorted [NbOg] octahedra that dominate the structure of NbO,, 


NbO, glass 


Intensity (a.u.) 


SiO, 


Si substrate 


Figure 1 | Nanocrystal-in-glass film preparation and structural 
characterization. a, POM-stabilized ITO aqueous dispersion, that is, 
POM;[N(CH3)4]6Nb9O2. b, ITO-in-NbO,, film on a glass substrate. On 
annealing, the anionic [Nb,9Q28]° clusters condense (Supplementary Fig. 3) 
and the [N(CH;),4]* counterions thermally decompose’. c, Two-dimensional 
XRD image of a ITO-in-NbO, film. d, XRD pattern obtained after integrating 
along the out-of-plane scattering direction Q,. The indexed diffraction lines 
result from randomly oriented ITO nanocrystals (cubic In,O3 bixbyite 
structure) whereas the broad peak at Q = 1.85 A” ’ is assigned to amorphous 
NbO,. e, f, Low- and high-resolution cross-sectional scanning transmission 
electron microscopy Z-contrast images (see Methods). 
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Figure 2 | Raman analysis probing the reconstruction of a NbO, glass 
matrix when linked to nanocrystals. a, Raman spectra of pure NbO, (black) 
and ITO-in-NbO,, nanocomposite (red) with 64 vol.% ITO. b-e, Fitted Raman 
spectra (experimental data in red, sum of fits in black). From bottom to top 
are pure NbO, followed by nanocomposites with increasing ITO volume 
fraction (0.48, 0.52 and 0.64). Deconvolved peaks (grey) arise non-specifically 
from ITO, NbO,,. and from the underlying glass substrate. Blue and green 
peaks specifically arise from NbO, (bridging oxygens and terminal 

oxygens, respectively). 


are readily observed in the range 500cm ' to 1,000cm_' and changes 
in their peak positions and relative intensities track with the interfacial 
area. This information was used to identify the structural changes in 
the glass as a result of linking to the nanocrystals. 

The topological structure of NbO, glass is characterized by how the 
[NbO,] octahedral units are connected, namely through vertices and 
along edges, as well as by the degree of connectivity in the disordered 
network. Given the higher electrostatic repulsions in edge-sharing topo- 
logies compared to vertex-sharing”’, distinctive vibrational modes for 
the bridging oxygens can be observed”°. By deconvolving these bridging 
oxygen bands (550cm | to 750cm_'), we find that highly distorted 
edge-sharing [NbO.] are prevalent in the nanocrystal-in-glass films, 
whereas pure NbO, contains more regular, vertex-sharing arrange- 
ments (Fig. 2, Table 1, Supplementary Fig. 4). In addition, the relative 
intensity of Raman peaks at higher frequencies (750 cm‘ to 950. cm_') 
increases at higher nanocrystal loading, suggesting reduced connecti- 
vity of the glassy network. In particular, the highest-frequency peak, 
900cm ! to 925cm 1, is assigned to terminal Nb-O bonds. The ratio 
between these terminal bonds and bridging ones is greater in the nano- 
composites (Fig. 2, Table 1, Supplementary Fig. 4), which we attribute 
to internal interruption of the polymerized network. Generally, the 


Table 1 | Peak-fitting analysis of Raman spectra 


degree of connectivity in oxide glasses can be reduced by addition of 
network modifiers or glass intermediates*’. In our case, we suggest that 
indium (In) and tin (Sn) from the ITO nanocrystals diffuse locally into 
the NbO,. glass during thermal processing. There they disrupt the 
glassy network, inducing chain breaking and thus reducing the network 
connectivity. Indeed, XRD patterns of films processed at increasing 
temperatures (400 °C-550 °C) showed broader ITO peaks (Supplemen- 
tary Fig. 5b), pointing to progressive interfacial dissolution of ITO in 
the NbO, matrix. The presence of a ternary InNbO, phase following 
high-temperature annealing (>550°C) (Supplementary Fig. 5c, d) 
further supports the idea that In and probably Sn diffuse into the 
NbO, matrix adjacent to the nanocrystals. Hence, the structure of 
the NbO, glass is profoundly altered by covalent linking to ITO nano- 
crystals, becoming highly distorted, less interconnected and interfa- 
cially doped (Fig. 3). 

The consequence of linking ITO nanocrystals within a NbO, glass 
matrix is that a unique electrochromic optical switching functionality 
can be realized (Fig. 4). As explained below, this new optical behaviour 
not only reflects the individual properties of the two components, but 
also is quantifiably superior as a result of the reconstruction of the 
amorphous material. 

Amorphous metal oxides are leading candidates for dynamically 
controlling solar radiation transmitted through ‘smart’ windows by 
means of their electrochromic properties. Electrochromism is a revers- 
ible change of optical transmittance in response to electrochemical 
charging and discharging. After more than 40 years of research in 
electrochromics”’, spectral selectivity, that is, independent modulation 
of visible and near-infrared (NIR) radiation, is still considered a ‘holy 
grail’ for reducing the energy needed to light and thermally regulate 
building interiors. For instance, others have proposed combining ther- 
mochromic and electrochromic materials to modulate NIR and visible 
light, respectively”’, although fully integrated devices based on this con- 
cept have yet to be reported. We took a step towards this goal through 
our recent demonstration that charged ITO nanocrystals selectively 
block NIR light through a plasmonic electrochromic effect” (Fig. 4f). 
In contrast, amorphous transition metal oxides such as NbO,. modu- 
late mostly visible light under electrochemical bias (Fig. 4a). Now, by 
linking these two materials and controlling the applied voltage, we find 
that the transmittance can be modulated in both spectral bands 
(Fig. 4a-g, Supplementary Fig. 1). The nanocrystal-in-glass coating 
switches progressively between three optical states: fully transparent 
(at 4 V versus Li), selectively NIR blocking (around 2.3 V) and broad- 
band blocking of visible and NIR (1.5 V). Thus, solar radiation can now 
be dynamically modulated with spectral selectivity (Supplementary 
Fig. 1). The modulation of NIR light derives from the free electrons in 
the ITO, whose concentration reaches approximately 1.9 X 107’ cm~? 
at 2.3 V (Supplementary Information and Supplementary Fig. 9). Ulti- 
mately, the modular construction of our materials will facilitate even 
greater modulation of NIR transmittance as new plasmonic nanocrys- 
tals are developed with a higher free-electron concentration and a broa- 
der dynamic optical range”. 

Remarkably, the modest visible-light modulation of amorphous NbO,, 
(Fig. 4a) was greatly exceeded by that of the nanocrystal-in-glass com- 
posites (Fig. 4b-d), indicating that the reconstructed NbO, glass is far 
more electrochemically active. The optical contrast in the visible range 
monotonically increased with nanocrystal content up to ~43% ITO, 
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Figure 3 | ITO nanocrystals covalently linked to amorphous NbO,. The 
[NbOg] octahedral units are shown in green and the ITO nanocrystals are blue. 


reaching a value that is five times greater than that of pure NbO, (Fig. 4h). 
At higher nanocrystal loading, the contrast decreased again, indicating 
that the enhancement is maximized when the average nanocrystal 
spacing is around 0.7 nm (Supplementary Information). At this high 
interface density (0.54 nm’), the structure of the NbO, is rearranged 
throughout, effectively resulting in a new polymorph that exhibits func- 
tional characteristics distinct from those of the bulk glass, in fact resembling 
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In the inset, the niobium atoms are located at the centre of the green octahedra, 
oxygen atoms are red and indium atoms are blue. 


more the stronger electrochromism seen in the high-temperature pseu- 
dohexagonal crystalline phase of Nb,O; (known as the TT-phase) (ref. 26). 

The reconstructed, nanocrystal-linked amorphous NbO,, network is 
also highly stable to insertion and extraction of Li ions. The electro- 
chemical cycling stability exceeds that of either individual component; 
only 4% charge capacity was lost after 2,000 cycles (between 4 V and 1.5 V), 
whereas pure NbO, and especially ITO nanocrystal films degraded 
substantially (20% and 85% capacity losses, respectively) (Supplemen- 
tary Fig. 7a). In agreement with the electrochemical stability of ITO-in- 
NbO,, composites, the dual-band optical modulation characteristics 
were maintained after cycling (Supplementary Fig. 7b). We suggest 
that the enhancements in both optical contrast and stability are related 
to the more open network structure of the reconstructed glass, which 
facilitates ion insertion and extraction and helps to relax the stress 
induced by these processes. Consistent with this hypothesis, the charge 
capacity increases proportionally with the optical contrast (Sup- 
plementary Fig. 8). 

Linking undoped In,O; nanocrystals to NbO,, instead of Sn-doped 
In,O3 nanocrystals, similarly enhanced the optical contrast arising 
from the matrix (Fig. 4h, Supplementary Fig. 10), which eliminates 
the possibility that potential electrochemically active Sn species could 
play a part in such enhancement. Instead, this result further confirms 
that the optical contrast enhancement is related to a structural recon- 
struction of the NbO,, matrix as a consequence of the covalent linkage 
to the embedded nanocrystals. Very probably, similar rearrangements 
of the glass structure could be involved in the enhanced ionic transport 
properties previously reported'*'*”” in diverse nanocrystal-in-glass 
materials made by a variety of methods. Therefore, reconstructing glass 


Figure 4 | Tunable dual-band solar control and optical contrast 
enhancement in nanocrystal-in-glass films. a-g, Transmittance spectra 
under applied electrochemical voltage (4 V in red, 2.3 V in blue and 1.5 V in 
cyan, versus Li/Li*), of ITO-in-NbO,, with different ITO nanocrystal content. 
Pure NbO, (a) and pure ITO nanocrystal films (f) are shown as reference. The 
specific interfacial area was tuned from 0 nm | to 0.92 nm‘! by varying the 
ITO nanocrystal percentage by volume between 0% and 69% at a constant size 
of 4.8nm + 0.8 nm, chosen to maximize the electrochromic response”. The 
atomic percentage of Sn in ITO was 12.2 + 0.1% and film thickness was 

150 + 35nm. g, Transmittance spectra of a typical ITO-in-NbO, film, with 
intermediate voltages shown in grey. h, Change in absorbance (AA) at 

A = 500 nm between 1.5 V and 4 V, versus volume fraction of ITO nanocrystals 
(frro). AA is normalized by NbO,, volume fraction. The AA of a control sample 
with undoped In,O3 nanocrystals is also plotted (star symbol). Photographs of 
charged films with fipo=0 and frro=0.34 are inset. 
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via nanocrystal linking could represent a general strategy for the mani- 
pulation of ion transport. 

In essence, this nanocrystal-in-glass approach represents the next 
step in the evolution of the concept of linking building units together to 
construct new functional materials. Analogously, organic molecules 
are held together by weak, dynamic bonds to form supramolecular 
assemblies”*, metal ions are covalently bonded to organic struts to form 
metal-organic frameworks” and nanocrystals are linked directly to 
each other in mesoporous architectures. We have shown that when 
nanocrystals are covalently bonded to glass, the linking plays a far more 
active part than in these earlier constructs: the glass structure can be 
profoundly changed, generating new amorphous structural motifs 
with distinct functionality. 


METHODS SUMMARY 


Nanocrystal-in-glass (ITO-in-NbO,) composite films were prepared from aqueous 
colloidal solutions, containing polyoxometalate clusters bonded to the nanocrystals’ 
surface. On solution deposition (spin coating) and thermal annealing (400 °C in 
air), polyoxometalates condense into an amorphous oxide matrix, which remains 
covalently linked to the embedded nanocrystals. The volume fraction of nanocrys- 
tals in the nanocomposite films was tuned by varying the polyoxometalate molar 
concentration in the initial colloidal solution. XRD and scanning transmission 
electron microscopy were used for structural characterization of the films at dif- 
ferent stages of the thermal process. In addition, films with different interfacial 
densities were analysed by Raman spectroscopy to systematically follow structural 
changes in the amorphous matrix induced by the presence of nanocrystals. In 
parallel, the electrochromic properties were studied by means of spectroelectro- 
chemical measurements. Transmittance spectra were acquired in situ, under 
applied voltage, by placing the film as working electrode in a liquid spectroelec- 
trochemical cell (Li foils as counter/reference electrodes and LiClO,4/propylene 
carbonate as electrolyte). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Nanocrystal-in-glass (ITO-in-NbO,) composite films were prepared from aqueous 
colloidal solutions containing polyoxometalate clusters bonded to the nanocrystals’ 
surface. On solution deposition (spin coating) and thermal annealing (400 °C in 
air), polyoxometalates condense into an amorphous oxide matrix, which remains 
covalently linked to the embedded nanocrystals. The volume fraction of nanocrys- 
tals in the nanocomposite films was tuned by varying the polyoxometalate molar 
concentration in the initial colloidal solution. XRD and scanning transmission 
electron microscopy were used for structural characterization of the films at dif- 
ferent stages of the thermal process. In addition, films with different nanocrystal 
interfacial areas were analysed by Raman spectroscopy to systematically follow 
structural changes in the amorphous matrix induced by the presence of nanocrys- 
tals. In parallel, the electrochromic properties were studied by means of spectro- 
electrochemical measurements. Transmittance spectra were acquired in situ, 
under applied voltage, by placing the film as working electrode in a liquid spectro- 
electrochemical cell (Li foils as counter/reference electrodes and LiClO,/propylene 
carbonate as electrolyte). 

Synthesis of POMs and ITO nanocrystals. We used two different POMs; the 
hexaniobates [Nb¢O19]* and decaniobates [NbyO23]° were both prepared using 
a solution precipitation method with tetramethylammonium, [N(CHs),] *, as coun- 
terion. We recently reported the synthesis and detailed characterization of these 
POMs’. Briefly, to synthesize decaniobate clusters, 3.19 g (10 mmol) of niobium eth- 
oxide (Nb(OEt)s, 99.99%, Alfa Aesar) were added to a flask containing 25 ml ethanol. 
Then, 1.81 g (10 mmol) tetramethylammonium hydroxide (N(CH3)40H-5H20, 
97%, Aldrich) were added to the flask and the reaction mixture was refluxed 
overnight, then cooled to room temperature and centrifuged. The resulting solid 
product was washed with H,O and precipitated with acetone (three times). After 
drying overnight under vacuum at room temperature, a crystalline white powder 
was collected (47% reaction yield). XRD of the resultant powder as well as thermo- 
gravimetry coupled to mass spectrometry confirmed the expected chemical formula: 
[N(CH3)4]sNb,yO2g°6H,O (ref. 9). The hexaniobates, [NbgO,9]°" , were synthesized 
by dissociating decaniobate clusters at pH 14. The reaction was performed at room 
temperature by mixing 0.21 mmol [N(CH3)4]sNb10O2s°6H2O and 1.48 mmol 
N(CH3)4OH-5H,0 ina flask containing 10 ml E1OH/H,O (1:1). After 2 h stirring, 
the solid product, with the expected chemical formula—[N(CH3)4]sNb¢O19°15 
H,O—was precipitated with acetone and dried overnight under vacuum. Given 
that the pH stability of hexaniobates is within the range of 12.5-14, we avoided 
further washing, which would induce its condensation to decaniobate clusters 
(stable at neutral pH)”’. 

ITO nanocrystals, with sizes ranging from 5 nm to 15 nm and atomic percent- 

age of Sn doping from 4 at.% to 12 at.%, were synthesized by following established 
colloidal methods**”’. The size of the nanocrystals was determined by collecting 
statistics in bright-field transmission electron microscope images, and atomic 
doping concentration was assessed by inductively coupled plasma atomic emission 
spectroscopy. The ITO-in-NbO,, composite films that we used for electrochromic 
properties testing and Raman characterization contained ITO nanocrystals with 
Sn at.% of 12.2 + 0.1 and diameter 4.8 nm + 0.1 nm. 
Linking process for POM-stabilized nanocrystals. As-synthesized ITO nano- 
crystals were stripped of their coordinated organic ligands and their naked, metal- 
adatom-rich surfaces’”'* were further functionalized with POMs. The chemical 
link between POMs and the nanocrystals is made by bonding nucleophilic POM 
oxygens to In/Sn adatoms in ITO. 

In the ligand-stripping process, a two-phase mixture was prepared in a vial by 
combining equal volumes of N,N-dimethylformamide (DMF) and organic-capped 
ITO (~60 mg ml ') dispersed in hexane. Then, ~7 mg nitrosonium tetrafluoro- 
borate (NOBF,, 97%, Acros) were added and the resulting mixture was either 
vigorously stirred or sonicated. After 5-10 min, ITO nanocrystals were quantita- 
tively transferred to the polar DMF phase as a result of their organic-free surface, 
now weakly coordinated to the solvent (DMF) molecules. These naked nanocrys- 
tals were precipitated with toluene and the supernatant removed by centrifugation. 
After further purification with toluene and DMF (three washings), the nanocrys- 
tals were dispersed in DMF (final concentration, 60 mg ml_!). Next, 1 ml of an 
aqueous solution of hexaniobate clusters, [N(CH3)4]sNb¢O19 (concentration varies 
depending on the targeted final volume fraction, typically 200 mg ml‘) were 
added to 1 ml DMF ITO dispersion, which instantly turned opaque. After vigor- 
ously stirring (30 min), the slurry becomes clear as a result of POMs bonding to the 
naked ITO nanocrystals, which now become perfectly soluble in water (Fig. 1a). 
The colloidal solution was washed three times with DMF/H,0 to ensure complete 
removal of DMF molecules. The resultant POM-ITO pellet was redispersed in 
ultrapure water. Note that hexaniobates were found to be more effective than 
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decaniobates in coordinating the bare nanocrystal surfaces to form stable aqueous 
dispersions. We ascribe this behaviour to the higher charge density of hexanio- 
bates, which facilitates the surface coverage and extent of linking to the nanocrystal. 

Addition of excess POM (that is, decaniobates) to the POM-ITO solution allowed 
us to vary the volume fraction of NbO, glass in the final ITO-in-NbO, film. 
Aqueous POM-ITO solutions are perfectly stable and no ageing has been observed 
after two years following preparation. Ethanol was added as co-solvent to increase 
the viscosity of the solution and thus improve the quality of the deposited film. 
Preparation of ITO-in-NbO, nanocrystal-in-glass films. POM-ITO colloidal 
solutions, in water/ethanol (1:1), were deposited by spin coating onto a substrate 
(such as Si or ITO-coated glass) which had been previously cleaned and treated 
with ultraviolet-ozone to ensure a hydrophilic surface. As-deposited POM-ITO 
films were then annealed at 400°C for 20 min, in air, using a rapid thermal 
annealing furnace at 10°C s’. 

XRD of POM-ITO and ITO-in-NbO,. The atomic structure of as-deposited 
POM-ITO (Supplementary Fig. 3) and annealed ITO-in-NbO, films (Fig. 2b-d, Sup- 
plementary Fig. 5) was assessed by XRD with synchrotron radiation (2 = 0.974 A), 
at the Stanford Synchrotron Radiation Lightsource (SSRL), beamline 11-3. Acquisi- 
tions were performed in grazing incident mode (incidence angle of 0.12°) and the 
scattered X-rays collected with an area detector. 

Transmission electron microscopy of POM-ITO and ITO-in-NbO,. Trans- 
mission electron microscope images of the POM-ITO colloidal solutions (Sup- 
plementary Fig. 2) were acquired on a JEOL-2100 with a LaB, filament at 200 kV 
and equipped with a Gatan camera. Samples for transmission electron microscope 
analysis were prepared by drying a drop of POM-ITO aqueous solution on the 
surface of a carbon-coated copper grid, which had been previously treated with 
ultraviolet-ozone. 

Cross-sectional scanning transmission electron microscopy images of ITO-in- 
NbO, films (Fig. le, f) were carried out in a Nion UltraStem operated at 100 kV 
and equipped with a Nion aberration corrector. A high-angle annular dark field 
(HAADEF) detector allowed us to record incoherent Z-contrast images, in which 
the intensity of atom columns directly reflects their mean square atomic number 
(Z). Specimens for scanning transmission electron microscopy were prepared by 
conventional methods, that is, grinding, dimpling and Ar ion milling. 

Scanning electron microscopy and energy dispersive spectroscopy. Scanning 
electron microscope imaging (Supplementary Fig. 6) was carried out on a Zeiss 
Gemini Ultra-55 microscope, using beam energies of 5-10kV and an In-Lens 
detector. Film thickness was determined by cross-sectional imaging. An inbuilt 
energy dispersive spectroscopy detector was used for elemental analysis. The 
volume fraction of each component in the final nanocrystal-in-glass films was 
estimated from the measured metal atomic ratios and their bulk densities. 
Raman spectroscopy of ITO-in-NbO, films. The bonding arrangement of the 
amorphous NbO, matrix was studied by Raman spectroscopy (Fig. 2), using a 
Horiba LabRAM Aramis instrument with confocal aperture. All Raman spectra 
were obtained with a X100 microscope objective (numerical aperture of 0.90), 
17.7 mW of laser power at an excitation wavelength of 532 nm, and an acquisition 
time of 300s. The spectral resolution was 1.5cm~' and the laser spot size was 
~721 nm. Several spots were examined on each sample and showed similar spectra. 
Raman peaks from the substrate (ITO-sputtered glass slides) appeared in a different 
spectral region than NbO,, allowing accurate data analysis and interpretation. 
Peak-fitting analysis was carried out in OriginPro software. 

Optical switching and electrochemical measurements of ITO-in-NbO, films. 
Optical switching properties at different voltages (Fig. 4) were studied in a home- 
built spectroelectrochemical cell, in which the ITO-in-NbO, film was placed as the 
working electrode and in situ transmission visible-NIR spectra were recorded as 
functions of the applied potential (1.5 V-4 V versus Li/Li*). The spectroelectro- 
chemical cell had a three-electrode configuration, with Li foils as counter and reference 
electrodes, the electrolyte being 0.1 M LiClO, in anhydrous propylene carbonate. 

The electrochemical stability was evaluated by cyclic voltammetry, in which the 
voltages were swept between the potential limits at 30 mV s_' 2,000 times. Spectro- 
electrochemical and cyclic voltammetry measurements were performed in an argon 
glove box with a ASD Quality Spec Pro visible-NIR spectrometer and a Bio-logic 
VSP potentiostat. ITO-in-NbO,, films were deposited on conducting glass sub- 
strates (that is, !TO-sputtered glass slides) to ensure good in-plane conductivity. 


31. Villa, E. M. etal. Reaction dynamics of the decaniobate ion [H,Nb;9Q22] in 
water. Angew. Chem. Int. Edn 47, 4844-4846 (2008). 

32. Choi, S.1., Nam, K. M., Park, B. K., Seo, W. S. & Park, J. T. Preparation and optical 
properties of colloidal, monodisperse, and highly crystalline ITO nanoparticles. 
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No increase in global temperature variability despite 
changing regional patterns 


Chris Huntingford', Philip D. J ones”, Valerie N. Livina*’>, Timothy M. Lenton® & Peter M. Cox’ 


Evidence from Greenland ice cores shows that year-to-year temper- 
ature variability was probably higher in some past cold periods’, but 
there is considerable interest in determining whether global warm- 
ing is increasing climate variability at present”®. This interest is 
motivated by an understanding that increased variability and 
resulting extreme weather conditions may be more difficult for 
society to adapt to than altered mean conditions’. So far, however, 
in spite of suggestions of increased variability’, there is consider- 
able uncertainty as to whether it is occurring’. Here we show that 
although fluctuations in annual temperature have indeed shown 
substantial geographical variation over the past few decades’, the 
time-evolving standard deviation of globally averaged temperature 
anomalies has been stable. A feature of the changes has been a 
tendency for many regions of low variability to experience increases, 
which might contribute to the perception of increased climate vol- 
atility. The normalization of temperature anomalies’ creates the 
impression of larger relative overall increases, but our use of absolute 
values, which we argue is a more appropriate approach, reveals little 
change. Regionally, greater year-to-year changes recently occurred 
in much of North America and Europe. Many climate models predict 
that total variability will ultimately decrease under high greenhouse 
gas concentrations, possibly associated with reductions in sea-ice 
cover. Our findings contradict the view that a warming world will 
automatically be one of more overall climatic variation. 

A recent analysis” of the NASA Goddard Institute for Space Studies 
gridded temperature data set* suggests that seasonal mean tempera- 
tures are starting to show increased variability, with a particularly large 
difference between the 1970s and the 1980s. The analysis first calcu- 
lates for each geographical position the standard deviation of temper- 
ature for period 1951-1980. It then divides each local time series of 
anomalies away from a mean (also calculated for 1951-1980) by those 
standard deviation values, and hence provides a normalization of year- 
to-year variations on to a single distribution for all points across the 
globe. When the distribution is presented for different decades, it is 
shown to widen, particularly when comparing the periods before and 
after 1980, indicating major changes in climate variability. Some fea- 
tures of climate shifts around that time have been linked to changes in 
the Pacific Decadal Oscillation®"’. In this work, we perform an analysis 
similar to that in ref. 2, using the European Centre for Medium-Range 
Weather Forecasts (ECMWEF) ERA-40"' data set that starts in 1958, 
and generate analogous distribution curves (Fig. la). The change in 
standard deviation between the distribution for 1958-1970 and that for 
1991-2001 (Fig. 1a) is an increase of 43% (Supplementary Table 1, first 
row). There are seasonal and hemispheric variations in these distribu- 
tions (Supplementary Fig. 1 and Supplementary Table 1), although the 
common feature of generally increasing variance remains present in all 
cases. All statistical details are described in Methods. 

One contribution to the increase over time of the standard devia- 
tions of the distributions, when using the algorithm leading to Fig. 1a, 


is geographically diverse rates of mean warming. To account for this, 
we reconsider the curves of Fig. la and, for each of the four periods, 
instead calculate anomalies relative to the average of each period (as 
opposed to always relative to 1958-1980). Division is again made by 
local standard deviations calculated for before 1980, but now for these 
new anomalies, giving the revised curves in Fig. 1b. Although this 
reduces the changes to the distributions between periods, there 
remains an increase of 22% in standard deviation between 1958- 
1970 and 1991-2001 (Supplementary Table 2, first row). 

To understand further, we instead derive at each ERA-40 data set grid 
box the yearly temperature anomaly (detrending with an 11-year local 
running mean) and then calculate a time-evolving, area-weighted, glo- 
bal, annual standard deviation from these. This time series of total 
variability is the black curve in Fig. 1c. Although there is evidence of 
some increase around 1980, this is an order of magnitude less than 
would be implied by ref. 2 and our Fig. 1a or Fig. 1b. The mean of the 
black curve in Fig. 1c before and after 1980 shows only a 2% change. We 
then normalize the value at each point by the local standard deviation 
calculated up to 1980, and re-derive the yearly, global standard devi- 
ation values based on this. We observe these values to increase more 
(Fig. Ic, red curve): the mean after 1980 is 16% greater than the mean 
before 1980, which is similar to the change calculated from Fig. 1b. The 
green curve in Fig. 1c is the ratio of the red and black curves. In Fig. 1d, 
we plot, for local standard deviations of the anomalies from detrending, 
calculated before and after 1980, the number of grid boxes in which 
the standard deviation subsequently increases (red curve) or decreases 
(green curve) after 1980. The curves are not similar and there is a tend- 
ency for low-variance regions to have a higher chance of becoming 
more variable. We offer this as an explanation why, overall, normalized 
variability has increased more. In terms of relative magnitude, an 
increase in variance for regions of low standard deviation will outweigh 
a similar decrease for a region of high variation. Absolute global stand- 
ard deviation for the ERA-Interim” period (1984-2006) is also plotted 
(Fig. 1c, cyan curve; overlapping the black curve for 1984-1996), and, 
rather than increasing, is actually found to be decreasing in the most 
recent years. 

Nevertheless, changes to regional variability have occurred. Sup- 
plementary Table 2 (and Supplementary Fig. 2) suggests that the 
largest changes in standard deviation, by season and hemisphere, are 
in the Northern Hemisphere spring and summer and the Southern 
Hemisphere autumn and summer. There is already some evidence that 
increased variability for Europe, imposed additionally on general 
warming, is increasing the frequency of European summer heatwaves”’. 
Supplementary Fig. 3 shows the average zonal changes to standard 
deviation before and after 1980, with prominent increases over mid 
latitudes for both hemispheres. We note observations from the hydro- 
logical cycle indicating that storm tracks have moved polewards in recent 
decades'*"*, along with Hadley cell expansion’”"*. To confirm more 
directly than by use of gridded re-analysis data that major temperature 
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variability changes are occurring, we analyse weather station data’? that 
contributes to the University of East Anglia Climatic Research Unit 
(CRU) climatology. We detrend each station data set of calculated yearly 
mean temperatures and derive a time-evolving series of standard devia- 
tions. To illustrate behaviour before and after 1980, we scale such that 
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Figure 1 | Measured changes in temperature variability. a, Normalized 
probability density functions for four periods of anomalies in the ECMWF 
ERA-40 data set. Anomalies are calculated relative to 1958-1980 mean values 
and then divided by their local standard deviation for that period. b, Anomalies 
calculated relative to the mean for each of the four periods and then divided by 
the standard deviation of these anomalies for 1958-1980. c, Global, yearly 
spatial average of standard deviations of anomalies from local, 11-yr 
detrending. Black curve, ERA-40; cyan, ERA-Interim (K). The only difference 
between the red curve and the black is that, for the former, the anomalies are 
divided by their local standard deviation for 1963-1980. The green curve is the 
ratio of the red and black curves. The black and red dashed lines are the average 
values of the correspondingly coloured curves to 1980. The green dash-dot 
lines are mean values of the green curve before and after 1980. d, Grid-box 
(area-weighted, detrended data) occurrences, for bins of size 0.01 K, showing 
increases (red) and decreases (green) in standard deviation in 1981-1996 
compared with 1963-1980. e, Percentage change in standard deviation of ERA- 
40 data, again comparing the periods after and before 1980: yellow, orange and 
red indicate more variability after 1980. 
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each series has mean of unity between 1975 and 1985. The spread of 
curves in Supplementary Fig. 4a is large before and after 1980, confirming 
significant change. In Supplementary Fig. 4b, c, we show only stations in 
mid Europe and North America; here variability predominantly 
increased during the 1970s and 1980s, but less so in more recent years. 

Figure le shows geographical change in standard deviation based on 
the ERA-40 data set. Each grid point is the percentage change in 
standard deviation in detrended annual temperature, comparing again 
the periods before and after 1980. Of particular interest are the very 
large changes in variability: in many places there are changes of more 
than 25%, both increases after 1980 (red) and decreases (dark blue). 
North America and Europe are once more seen to have generally 
experienced increases in standard deviation, although the magnitude 
of changes is seasonally dependent (Supplementary Fig. 5; for much of 
Europe the largest increases occur during December-May). Of note 
is the strong spatial structure of the map in Fig. le, which has large 
coherent regions of change. 

Hovmiéller plots provide time-evolving spatial (zonal) information. 
Figure 2 complements the map in Fig. le and shows, for each year and 
10° latitude band, standard deviation as a percentage of its value for 
1968-1980, calculated for both the ERA-40 period and the ERA- 
Interim period. The increased mid-latitude standard deviation values 
are apparent, along with a general tropical decrease. There is again a 
suggestion, towards the end of the ERA-Interim data, that the zones of 
increased standard deviation are starting to subside, especially after 
1995. 

To assess whether the observed changes in variability are transitory, 
we analyse projections made using general circulation models. Figure 3a 
shows the global, time-evolving standard deviation from one such 
model, HadGEM2-ES”, using the same method as that giving the black 
and cyan curves in Fig. 1c for ECMWF data. For this model, the 
Coupled Model Intercomparison Project Phase 5 (CMIP5) database”? 
has four ensemble members (all shown in Fig. 3a) driven first with all 
historical climatic forcings and then by the atmospheric greenhouse gas 
changes prescribed in representative concentration pathway RCP8.5”. 
The mean of the 11-yr moving averages of each ensemble member is the 
thick black curve. This predicts major ongoing reductions in variability 
during the twenty-first century, and suggests that there was a small 
increase in variability during the 1980s and 1990s. The thick red curve 
is the mean of the global standard deviation for ERA-40/ERA-Interim 
(moving average of black and cyan values in Fig. 1c), and shows that 
global variability first increased slightly and then decreased. 
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Figure 2 | Zonal changes in measured temperature variability. Hovmoller 
plots of time-evolving standard deviation in 10° latitudinal zonal bands. 

a, ERA-40 data set; b, ERA-Interim data set. All data shown as percentage 
change relative to zonal averages for 1968-1980. 
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Figure 3b-e shows the percentage changes in yearly zonal standard 
deviation for the four HadGEM2-ES ensembles with historical forcings 
and RCP8.5, compared with their mean to 1980. Eventual variance 
reduction occurs across most latitudinal bands, and all changes have 
spatial structure and some evidence of temporal persistence beyond the 
llyr implicit in the moving averages (Methods), although regional 
timings differ between simulations. There are predictions of transitory 


11-yr moving average 4 
11-yr ECMWF moving average 4 


Global s.d. (K) @ 


-50° 


Latitude north © 
8 


De) 
a 
a 
o 
DO 
= 
So 


10 


50° 
0° 
-50° 


Latitude north ® 
Aan BAS LAROD SRA 
oitisitiitisis 


1 
2050 


O 
are 
So 


10 


50° 
0° 
-50° 


Latitude north & 


Uy 
2050 


ie} 
a 
i=] 


0 


Latitude north ® 
8 
Pept 


-50° 
1900 1950 2000 2050 2100 
<S 0.64- 2 
kc) c J 
2 056K =] 
a = | 
2 bE ee 
ro} [Mean of CMIP5 models (11-yr ti i 4 
C008 eevee. ee : 
1850 1900 1950 2000 2050 2100 
9g 
<S 0.64 4 
so) a - 
2 0.56 am 
3 C 7 
) c 7 
oO r M f CMIPS dels (31-yr ti i + 
G 0.48 r Spread of Ieereaimodale yume avernalna) 4 
1850 1900 1950 2000 2050 2100 


Year 


Figure 3 | Projected changes in temperature variability under the RCP8.5 
scenario. a, Global, spatially averaged standard deviation in yearly temperature 
anomalies (from 11-yr detrending) for four HadGEM2-ES simulations, driven 
with all historical forcings and then the RCP8.5 scenario. The mean of the 11-yr 
moving averages of the ensemble members is the thick black curve; dashed line 
shows the average of the black curve up to 1980. The thick red curve is the 11-yr 
moving average of ERA-40 and ERA-Interim global standard deviations, and 
for comparison is scaled to have the same average value as the model ensemble 
average before 1980 (black dashed line). b-e, Percentage changes in zonal 
standard deviation relative to average before 1980, for the four simulations 
shown in a. The colour scale is as in Fig. 2. f, Thick black curve as in a, but 
averaged across CMIP5 models (for individual models, see Supplementary 
Fig. 6). Then each model is scaled such that its pre-1980 average is the same as 
the all-model mean (dashed line), which allows comparison of relative changes 
(turquoise model spread). g, As in f, except local anomalies use 

31-yr detrending, and data plotted are 31-yr moving averages of global 
standard deviations. 
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occurrences of increased variability (orange/red patches) in recent dec- 
ades, but the timings, positions and magnitudes of these variations vary 
markedly between ensemble members. 

Figure 3f shows the mean, time-evolving, global standard deviation 
for all 17 climate models with historical-plus-RCP8.5 simulations in 
the CMIP5 database”? (thick black curve). Individual model standard 
deviation curves are shown in Supplementary Fig. 6, and approxi- 
mately two-thirds show an eventual definite decrease in average vari- 
ability. Scaling each model’s time-evolving standard deviation curve to 
have the same average value (same as dashed line) before 1980 allows 
relative change to be seen more clearly. The bounds of these scaled 
curves also highlight this cross-model decrease towards the end of the 
twenty-first century (turquoise area). These multimodel findings are 
robust to changes in the timescale selected for both detrending by local 
means and subsequent calculation of global, moving averages of stand- 
ard deviation. Figure 3g and Supplementary Fig. 7 have features in 
common with Fig. 3f and Supplementary Fig. 6 when instead using 
31-yr timescales. 

One possible reason for the reduction in global temperature vari- 
ability is the decrease in Arctic sea-ice cover in these model projections. 
Sea-ice extent strongly affects multiple features of the global climate 
system***°. Changes in it have been argued to amplify, in particular, 
Northern Hemisphere temperature anomalies through sea-ice albedo 
feedbacks**’. Reductions in sea-ice thickness and age have reduced 
the timescale of changes in sea-ice extent, acting to increase tem- 
porarily the amplification of interannual temperature anomalies for 
mid latitudes’. Ultimately, however, major loss of summer sea ice in 
extreme global warming scenarios will lead to the removal of a possible 
enhancement of interannual temperature anomalies. In Fig. 4, we plot, 
for the CMIP5 models, our global standard deviation values against 
sea-ice extent. This shows a general decrease in standard deviation 
for less sea ice. When presented as individual plots for each model 
(Supplementary Fig. 8), some models show an almost linear corres- 
pondence between the two quantities, potentially consistent with the 
hypothesis that the loss of sea ice removes an important amplifier of 
interannual temperature variability’. However, these figures do not 
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Figure 4 | Projected changes in temperature variability against changes in 
sea-ice cover. Thirty-one-year moving averages of CMIP5 estimates of global, 
annual sea-ice cover, plotted against the 31-yr moving averages of global 
temperature standard deviation. Non-black colours and symbols represent the 
various models, shown every decade after 2000 (for individual model values, see 
Supplementary Figs 7 and 8). For comparison, each model’s predictions of 
standard deviation and sea ice are scaled to unity for the year 2000 (arrow). The 
dashed black line is the average across scaled models, covering change in sea ice 
over the range that most models span. The straight black line is linear fit to the 
dashed curve. 
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demonstrate a causal link, which would require additional climate 
change projections excluding sea-ice feedbacks. 

We identify major changes in temperature fluctuations in ECMWF 
gridded data and CRU data from individual stations. Significant and 
coherent spatial shifts are detected in regions of differing temperature 
variability, but we find much less evidence of changes when the data are 
averaged globally’. Increases in variability are especially evident during 
the 1980s and 1990s in the densely monitored regions of Europe and 
North America. However, even in these regions there is observational 
evidence that interannual temperature variability is now decreasing. 
Climate models strongly suggest that this might be the beginning of a 
longer-term decline in year-to-year temperature variability in response 
to increases in atmospheric greenhouse gas concentrations. 

Changes in climate variability are arguably more important for 
society than changes in mean climate such as overall warming, espe- 
cially if they relate to altered extremes***”’. On the basis of the results 
presented here, we believe that greater emphasis now needs to be 
placed on analysing changes in climate variability in the context of 
anthropogenic climate change, so as to inform more effective adapta- 
tion strategies. 


METHODS SUMMARY 


Our data are based on yearly or seasonal averages of atmospheric temperature 
values 2 m above the surface, and from the ECMWE ERA-40"! and ERA-Interim™ 
products. These are six-hourly, globally complete data sets, and in combination 
cover the period from 1958 to the present. They constitute a re-analysis product, 
derived by assimilating meteorological measurements into a weather forecast 
model to retrospectively provide high-frequency, gridded estimates of evolving 
weather states. 

For future climate projections, we use models from the CMIP5 family of climate 
simulations”. Of these models, 17 are available driven first with historical forcings 
(all forcings) and then RCP8.5 atmospheric gas concentration changes”. 

To calculate local, zonal or global average standard deviations (Figs 1c-e, 2, 3 
and 4), for each grid box, anomalies are first calculated by detrending local tem- 
perature values with a running mean (calculated over 11 yr except in Figs 3g and 4, 
which use 31-yr running means). Zonal or global spatial standard deviations of 
these anomalies are calculated with area weighting for each grid box. ERA-40 and 
ERA- Interim cover the years 1958 to 2011 inclusive, so with local, 11-yr detrend- 
ing, global average standard deviation values (see, for example, Fig. 1c) will corre- 
spond to the remaining years of 1963-2006. When 11-yr moving averages of these 
time-evolving zonal or global standard deviations of the de-trended anomalies are 
calculated (see, for example, Fig. 2 or the thick red curve in Fig. 3a), this corre- 
sponds to a remaining period of 1968-2001. Similarly, when climate-model local 
anomalies are derived from 31-yr detrending and then 31-yr moving averages are 
calculated on the basis of the globally averaged standard deviation of these anom- 
alies, this gives a time series for the years 1890-2070 for simulations covering 
1860-2100 (Fig. 3g). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The ECMWF ERA-40" gridded data sets are globally complete and have full years 
of 6-hourly temperature data between 1958 and 2001 inclusive. ECMWF ERA- 
Interim similarly has 6-hourly data, and at present covers the period from 1979 
onwards. As time progressed, the ERA-40 analysis entrained more measurements, 
most notably satellite retrievals after 1979 (ref. 11). Comparison with the weather- 
station-based CRUTEM4 data set’®, however, reveals broad agreement, especially 
concerning year-to-year variability through that transition period’’*’. The 
detrending of data (as we have done, predominantly with a time period of 11 yr) 
will remove known offsets, which are more present for the Southern Hemisphere 
and especially the Antarctic’’. 

In Fig. 1a, we calculate at each ECMWEF grid box the yearly average temperature. 
Anomalies are then found by initially subtracting from these the average of the 
yearly temperatures between 1958 and 1980, before then dividing these values by 
the local standard deviations of these numbers for 1958-1980. For each of the four 
time periods, we then put all of these normalized yearly anomalies, from across the 
globe, into bins of size 0.05. To allow area weighting (that is, smaller grid boxes 
near the poles), bin values are multiplied, for each anomaly value falling in them, 
by cosine (in degrees) of latitude. The distributions plotted are then scaled to have 
an area of unity beneath them, thus allowing comparison. Figure 1b is identical to 
Fig. 1a, except that in Fig. 1b the initial calculation of anomalies for each curve is 
made relative to the mean of their own time period. We then divide by the local 
standard deviations of these new anomalies, again for the period 1958-1980. 

In Fig. Ic, time-evolving global standard deviations of anomalies are analysed. 
Here, to remove the longer-term warming signals, anomalies of the ERA-40 data 
are first created by subtracting from the local, yearly average values their 11-yr 
running mean. Presented (black curve) for these anomalies and each year of the 
mid-points of this 11-yr detrending, giving the period 1963-1996, is their area- 
weighted standard deviation across the globe. We extend these calculations (cyan 
curve) for the ECMWF ERA-Interim data, which, again with 11-yr detrending, 
gives a time series for the years 1984-2006 inclusive. Our ERA-40 detrended 
anomalies are then normalized, at each geographical point, by division by the 
standard deviation of those numbers for the period 1963-1980. The red curve is 
the time-evolving, global standard deviation, but now of these normalized anom- 
alies. Figure 1d, e considers grid-box standard deviations before and after 1980, 
again using the (non-normalized) temperature anomalies relative to 11-yr running 
means. Standard deviation is calculated as a pair of values for each grid box, and for 
two periods, 1963-1980 and 1981-1996. In Fig. 1d, we bin, by standard deviation 
value calculated for the 1963-1980 anomalies, the number of occurrences (area 
weighted by multiplying by cosine of latitude for each occurrence) of an increase or 
a decrease in standard deviation in the subsequent period, 1981-1996. Figure le is 
a map of percentage change in standard deviation between these times, again using 
the same pairs of values calculated at each grid box. Seasonal versions of Fig. 1a, b 
and e are in Supplementary Information. Furthermore, Supplementary Fig. 3a 
gives zonal differences in these standard deviations, thus again comparing the 
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period after 1980 with the period before (plotted at ECMWF latitudinal resolu- 
tion), and Supplementary Fig. 3b gives zonal temperature changes. 

Figure 2 is a plot of time-evolving, latitudinal standard deviation for ERA-40 
and ERA-Interim. In each grid box, standard deviations of anomalies are calcu- 
lated over (moving) 11-yr periods. This, in combination with the initial 11-yr 
detrending to create the anomalies, gives calculations for the years 1968-1991 
inclusive when using ERA-40 and calculations for the years 1989-2001 inclusive 
when using ERA-Interim. For each of these years, zonal mean standard deviations 
are then derived for 10° bands and expressed as percentage changes compared to 
their average value before 1980. 

Figure 3a shows four time-evolving, global standard deviations of individual 
ensemble members of the HadGEM2-ES model, and for historical-plus-RCP8.5 
forcings. These are calculated for each year across all model grid boxes (and are 
area weighted) and again from anomalies that have been derived by local detrend- 
ing with 11-yr running means. The thick black curve is the mean of the 11-yr 
moving averages of these four historical-RCP8.5 ensembles. The thick red curve is 
the 11-yr moving average of the global standard deviation values for ERA-40 and 
ERA-Interim (and, hence, a smoothed version of the black and cyan curves in 
Fig. 1b). Figure 3b-e shows zonal averages of 11-yr moving averages of standard 
deviations from the detrended data, which are hence similar calculations to those 
of Fig. 2 for the ECMWF data (same colour scheme as Fig. 2, and the change is 
expressed as a percentage of the mean standard deviation before 1980). For each of 
the 17 climate models available with simulations made using scenarios of histor- 
ical-plus-RCP8.5 forcings, and again averaging across ensembles, curves identical 
to the black one in Fig. 3a are derived and shown in Supplementary Fig. 6. The 
mean of all of these curves is the black curve in Fig. 3f, which provides an overall 
time-evolving estimate of global standard deviation averaged across climate mod- 
els (where, as outlined above, each climate model is averaged across available 
ensembles of its estimate of 11-yr moving averages of global standard deviation, 
themselves based on anomalies from 11-yr local detrending). Additionally each 
model-specific curve in Supplementary Fig. 6 is then scaled such that its mean up 
to 1980 is the same as the all-model mean. The spread of these curves (time- 
evolving maximum and minimum global standard deviations across models) is 
the turquoise area. Figure 3g and Supplementary Fig. 7 are identical to Fig. 3f and 
Supplementary Fig. 6, respectively, except that local anomalies are instead first 
detrended with 31-yr running means. We show the moving averages of global 
standard deviations of these anomalies, with the averaging also now at the 31-yr 
timescale. All calculations for sea ice (used in Fig. 4 and Supplementary Fig. 8) are 
based on 31-yr moving averages of annual average areal coverage. 
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The role of behaviour in adaptive morphological 
evolution of African proboscideans 


Adrian M. Lister! 


The fossil record richly illustrates the origin of morphological adapta- 
tion through time. However, our understanding of the selective forces 
responsible in a given case, and the role of behaviour in the process, 
is hindered by assumptions of synchrony between environmental 
change, behavioural innovation and morphological response. Here 
I show, from independent proxy data through a 20-million-year 
sequence of fossil proboscideans in East Africa, that changes in 
environment, diet and morphology are often significantly offset 
chronologically, allowing dissection of the roles of behaviour and 
different selective drivers. These findings point the way to hypo- 
thesis-driven testing of the interplay between habitat change, beha- 
viour and morphological adaptation with the use of independent 
proxies in the fossil record’. 

Elephants (Elephantidae) arose in Africa from within a paraphyletic 
assemblage of gomphothere proboscideans (Fig. 1) with low-crowned 
molars comprising a series of rounded cusps. The fusion of these cusps 
into transverse ‘lophs’ led to the lamellar structure of the elephant 
molar which then, in parallel in different genera, multiplied the num- 
ber of lamellae and increased the height of the crown (hypsodonty)’. 
These changes occurred in Africa against a backdrop of Neogene 
aridification and replacement of C3-dominated woodland with C4- 
dominated grassland*>. 

Palaeosol 5'°C data show that the spread of C4 vegetation in East 
Africa occurred gradually, beginning around 10 million years (Myr) 
ago and accelerating after 4 Myr*° (Fig. 2a). Enamel 5'°C data (Fig. 2b) 


Prodeinotherium 


indicate that late gomphotheres and early elephants both switched 
rapidly from a C3-dominated to a C4-dominated feeding strategy 
around 8 Myr ago, consuming a substantial proportion of grass at a 
time when the vegetation was a mosaic still containing significant 
areas of C3 woodland**. This behavioural switch to grazing-dominated 
mixed feeding occurred in parallel in several proboscidean genera and 
persisted little-changed through their subsequent history (Fig. 2b). It is 
interesting, however, that in the derived gomphothere Anancus and 
stegodont Stegodon, significant individual or population variation in 
feeding strategy is seen in their spread of enamel 5'°C values, exem- 
plifying the behavioural ‘choice’ that probably originated the switch 
to grazing at the species level. One group of proboscideans, the dein- 
otheres, remained strict C3-feeders (very probably browsers) within 
remnant woodland habitat through this period (Supplementary Fig. 1), 
highlighting the availability of a range of habitats; deinotheres retained 
a simple, low-crowned dentition throughout’. 

The interpretation of C3 or C4 feeding in African Neogene ungu- 
lates as largely indicating browse or graze, respectively, is supported by 
cross-taxon studies, including dental mesowear analysis, as well as 
vegetation data. Equids, in the same East African sites as the probos- 
cideans studied here, show browsing mesowear associated with a C3 
enamel 5'°C signal at 9.9 Myr ago, and grazing mesowear associated 
with a C4 signal at 7.4 Myr ago’. At Laetoli (Tanzania, ~4.4-2.6 Myr 
ago), despite phytolith evidence for C3 as well as C4 grasses, the com- 
bination of dental 8'°C and mesowear analysis indicates that the C3 
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Figure 1 | Phylogeny of proboscidean genera included in this study. Bold 
lines indicate the approximate duration of existence of each genus in East 
Africa; Anancus and Palaeoloxodon endured longer elsewhere in the continent 
(Supplementary Tables 1-3 and ref. 6). Dashed lines indicate relationships®, but 


position of nodes does not convey exact date of divergence. Stegodon is 
considered the sister-group of Elephantidae by some authors; others subsume 
Palaeoloxodon within Elephas (ref. 6 and Supplementary Information). 
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Forest Woodland Wooded Grassland 
grassland 
Figure 2 | Environmental, behavioural and morphological trends for East 
African proboscideans over the past 20 Myr. a, The environmental proxy: 
index of vegetation from palaeosol pedogenic carbonate 5'°C. Data are from 
selected localities in Kenya, Tanzania (Olduvai), and Ethiopia (Omo Valley). 
Turquoise, Lothidok; purple, Fort Ternan; brown, Baringo; yellow, Lothagam; 
pink, Kanapoi; green, Koobi Fora; red, Nachukui; orange, Shungura; maroon, 
Kanjera; blue, Olorgesailie; grey, Olduvai; see Supplementary Table 1 for details 
and sources. Vegetational interpretation from ref. 4. b, The behavioural 
(dietary) proxy: 5'°C from proboscidean dental enamel. Broad dietary 
classification from ref. 3. c, d, The adaptive morphology: number of transverse 
enamel units (cusp pairs, lophs or lamellae) in upper and lower M3 


component of a range of mixed-feeding mammal species was largely 
browse’. Nonetheless, it cannot be ruled out that some C3-feeding 
proboscideans before 8 Myr ago may have been taking a proportion 
of C3 grass (like modern elephants that favour bamboo, a C3 grass’). 

The Late Miocene transition from C3 browsing to C4 grazing is seen 
in other ungulate groups in Africa and elsewhere*”’, and has been 
linked to an increase in the crown height (hypsodonty) of the molars. 
Elephants underwent a threefold increase in crown height (Fig. 2d) as 
well as substantial multiplication of the number of enamel lamellae 
(Fig. 2c), both considered adaptations to increased dietary wear, and in 
the case of lamellar number, possibly also to the mechanics of chewing 
grass. Strikingly, most of the increase in hypsodonty and lamellar 
number occurred after 5 Myr ago, a lag of at least 3 Myr after the 
adoption of a grass-dominated diet (Fig. 2). Small increases in lamellar 
number are seen at around 10 Myr ago and, together with modest 
hypsodonty increase, at 8-6 Myr ago, coincident with shifts in 64°C 
both in pedogenic carbonate and tooth enamel (dietary signal); we lack 
the resolution to assess whether these anatomical shifts trailed the 
behavioural shift. However, the limited nature of this dental response 
to grazing among the bunodont gomphotheres and brachyodont early 
elephants and stegodonts before 5 Myr ago (compare Fig. 2b and 2d) is 
surprising and remarkable, as these are morphologies generally assoc- 
iated with soft-leaf browsing”. 

Such offsets between behaviour and morphology demand that we 
think beyond the traditional palaeontological assumption of a lock-step 
between the two. One possibility is that behavioural accommodation 
preceded morphological adaptation by placing the organism in a new 
niche where selection then led to morphological adaptation’. The beha- 
vioural switch to C4 grazing at around 8 Myr ago in late gomphotheres 
and early elephants would have led to selection for enhanced dental 
resistance to abrasion. Grass contains more biogenic silica (phytoliths) 
than browse", and grazing (generally low-feeding) ungulates also pick 
up more soil-derived grit than tree- and shrub-feeders’**. Even if some 
proboscideans were taking a proportion of C3 grass before this date, this 
might not have produced strong selection on dental morphology 
because of the lower phytolith concentration than in C4 grasses'* and 
the semi-wooded habitat with limited pedogenic and airborne dust (see 
below). Early responses to grazing pressure are seen in the minor dental 
advances before 5 Myr ago. But the major adaptation, a substantial 
increase in hypsodonty, required significant reorganization of cranial 
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(c), hypsodonty index in upper M3 (d). Key to b-d: triangles, stegodonts and 
gomphotheres: pink, Archaeobelodon; orange, Gomphotherium; blue, 
Afrochoerodon; red, Choerolophodon; purple, Tetralophodon; brown, Anancus 
kenyensis; pale brown, A. ultimus; green, Stegodon; pale green, S. kaisensis 
“‘Warwire Stage’; open, indeterminate gomphothere. Circles, elephants: orange, 
Stegotetrabelodon; purple, Primelephas; blue, Loxodonta; yellow, Elephas; 

red, Palaeoloxodon; open, indeterminate elephant. Open diamonds: 
tetralophodont/elephantid intermediate. Horizontal green lines mark interval 
8-5 Myr ago when diet changed (b) but with little morphological response 
(c, d). See Supplementary Tables 2 and 3 for details of species, localities and 
dating. Supplementary Fig. 2 provides an enlarged version of this figure. 


anatomy to accommodate high-crowned teeth’, which could account, 
in part, for the marked delay in its appearance. 

The concept ofan evolutionary ‘lag’ due to behavioural initiation has 
sometimes been considered implausible because of the implied period 
of ‘maladaptation’’’. If the gomphotheres, stegodonts and elephants in 
the 8-5 Myr ago interval were suffering enhanced tooth wear as a result 
of grazing, their survival could have been ensured by countervailing 
competitive advantages of exploiting a new niche, but with life-history 
adjustment to reduced longevity (for example, earlier reproduction) as 
their teeth wore out sooner. Increased tooth-life through hypsodonty 
and lamellar increase, reversing this trend, would then have been at 
a selective premium. Studies of several lineages of North American 
mammals also indicate significant time lag between the appearance 
of open habitats and evolution of hypsodonty*». For example, in a 
study of horses (Equidae) with dental mesowear as dietary proxy, a 
gradual transition to grass-eating began around 22 Myr ago, but major 
hypsodonty increase did not begin until about 14 Myr ago; popula- 
tions in the intervening period are considered to have been pioneering 
new habitats and consequently under elevated selection pressure’*"®. 
Another morphological feature in proboscideans potentially explain- 
ing their adaptation to graze after 8 Myr ago, is their peculiar system of 
horizontal dental replacement. However, this had already largely been 
achieved before that time, providing a large lifetime dental battery for 
feeding very large body size, within a short jaw biomechanically linked 
to the heavy head and tusks”. More significantly, the advent around 
7 Myr ago ofa block-like molar crown of parallel lamellae held together 
by cement was the key dental adaptation of crown group Elephantidae 
that subsequently allowed increased hypsodonty and lamellar mul- 
tiplication, explaining their ultimate success at the expense of the 
gomphotheres and stegodonts. 

A second factor potentially explaining the ‘lag’ between the onset of 
grass-eating and the morphological response is a decoupling of the 
selective factors associated with grazing. Whereas the switch to grass- 
eating is attested by the strong C4 signal within enamel 5'°C after 10- 
8 Myr ago, the quantity of soil-derived grit might have increased later. 
We do not yet have good proxy data for environmental grit, or its 
consumption by fossil mammals, although dental microwear is a 
potential tool’’. However, the remarkable parallel, from 3.5 Myr ago 
onwards, between the increasingly open environments indicated by 
palaeobotanical evidence and palaeosol 5'°C*"* (Fig. 2a), and the 
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Figure 3 | Incremental evolution of hypsodonty index in the African 
Plio-Pleistocene Palaeoloxodon recki/iolensis lineage. Hypsodonty index 
from data in Fig. 2d (circles) plus P. iolensis from North Africa (squares) and 
southern Africa (diamonds). 


major trend in elephant hypsodonty and lamellar number (Fig. 2c, d), 
could be due to additional selective pressure imposed by increasing 
quantities of dust and grit, both airborne and from soil at the place of 
feeding. This in turn was driven by the increased aridity after 3.5 Myr 
ago’, a factor known to be strongly correlated with airborne partic- 
ulate matter’’, combined with the absence of local tree cover acting 
as sediment traps*’. The pressure would have been enhanced by the 
shorter grass species typical of more arid grassland, leading to increased 
soil uptake by grazing ungulates’, and by competition from the 
expanding guild of grazing species”. 

These factors imply an evolutionary ‘ratchet effect’ whereby the 
initial behavioural commitment to grass-eating at 10-8 Myr ago not 
only placed species in a niche with selection for morphological adapta- 
tion, but where later environmental changes increased the selective 
pressure. The resulting trend in hypsodonty and lamellar number 
in the Palaeoloxodon recki/iolensis lineage (= Elephas recki/iolensis 
of some authors), in particular, represents a remarkable incremental 
transformation*”’ (Fig. 3). Maximal values of lamellar number and 
hypsodonty are not reached until 0.5 Myr ago, at least 7 Myr after the 
onset of a grazing-dominated diet. Parallel shifts in morphology, with 
very similar chronology, are seen in two other elephant lineages origin- 
ating in Africa, Elephas** and Mammuthus”, in the different envir- 
onmental settings of southern and northern Eurasia, respectively. This 
might reflect similar environmental pressures, or a common limit to the 
rate of adaptation following an initial shift to grazing. A further hypo- 
thesis for the protracted evolution of hypsodonty, an evolutionary arms 
race between plant phytoliths and mammalian dental adaptation, is not 
supported by phytolith data through the history of C4 grasses”®. 

In contrast to the evolutionary trends in elephantids, the two late 
elephantoid lineages Stegodon kaisensis and Anancus kenyensis/ultimus, 
having survived a behavioural shift to grazing, lacked the key dental 
innovations allowing hypsodonty, so apparently succumbed to further 
environmental change. Both show the addition of an enamel cusp pair 
or loph (Fig. 2c) and complexification of the enamel band around 
4 Myr ago®”*®, a ‘last throw’ to achieve increased resistance to abrasion 
in molars, but evidently insufficient as they went regionally extinct 
soon after®. 

Finally, this study sheds light on the origin of the living African 
forest and savannah elephants Loxodonta cyclotis and Loxodonta afri- 
cana (L. a. africana and L. a. cyclotis of some authors). Their similar 
hypsodonty index (typically in the range 1.7-2.1) and lamellar count 
(mode 10-12) (see Methods) indicates a semi-open, mixed-feeding 
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Figure 4 | Hypsodonty evolution in African Pliocene to Recent Loxodonta. 
Hypsodonty data from Fig. 2d plus southern and North African L. atlantica 
and Recent L. africana and L. cyclotis. Squares, fossil samples: orange, 

L. cookei; green, L. adaurora; purple, L. exoptata; white, L. sp. grey, 

L. atlantica zulu; black, L. atlantica atlantica. Blue triangle, L. africana; 

red circle, L. cyclotis. 


ancestry for L. cyclotis despite its current forest habitat and browsing 
diet. Fossil evidence shows that whereas modern lamellar count was 
attained in the lineage by about 3.5 Myr ago, modern hypsodonty 
index was not reached until after 2 Myr ago (Figs 2c, d and 4), so 
Loxodonta may have entered Central African forests around this time, 
consonant with its dietary flexibility shown by a wide range of 5'°C 
values (Fig. 2b), and the lack of its remains on the savannahs after this 
date, as Palaeoloxodon recki became ecologically dominant’’. These 
data also indicate that the divergence date between L. africana and 
L. cyclotis was at, or somewhat more recent than, the younger end of 
the range 2.6-5.6 Myr ago estimated from nuclear DNA”. Whether 
the population ancestral to L. africana remained in the savannah 
throughout, or re-emerged from the forest*’, remains to be seen. 


METHODS SUMMARY 


Palaeosol carbonate and dental enamel 5'°C values were collated from the litera- 
ture and are expressed as %bo relative to standard. Because of similar fractionation 
relative to the living plant (~ 14%o for enamel’, 12-17%o for pedogenic carbonate), 
they occupy similar but not directly comparable ranges. Interpretation of 81°C 
ranges as vegetation types is based on modern East African data from ref. 4, and as 
dietary categories from ref. 3. Only East African proboscidean genera for which 
5'°C enamel data are available are included in the morphometric analysis. Taxa are 
plotted as genera; species designations and taxonomic revisions are given in 
Methods and Supplementary Tables 2 and 3. Dental parameters are calculated 
from literature data and original measurements. Hypsodonty index is 100 X maxi- 
mum unworn height of upper M3 crown divided by maximum crown width 
including cement. Transverse cusp pairs in gomphotheres, lophs in Stegodon 
and early elephants, and lamellae (plates) in later elephants are considered homo- 
logous; their number excludes ‘talons’ and is pooled for upper and lower M3. 
Sample ages are based mainly on radiometrically dated volcanic tuffs (Supplemen- 
tary Tables 1-3). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Palaeosol and enamel stable isotopes. Palaeosol carbonate and dental enamel 
5'°C values were collated from the literature (Supplementary Tables 1 and 2 and 
references in Supplementary Information), and are expressed as %o relative to PDB 
(Pee Dee Belemnite) standard. Because of similar fractionation relative to the 
living plant (~ 14%o for enamel’, 12-17%o for pedogenic carbonate’), they occupy 
similar but not directly comparable ranges. Interpretation of 5'°C ranges as 
vegetation types is based on modern East African data from ref. 4, and as dietary 
categories from ref. 3. 

Palaeosol carbonate 5'°C in Fig, 2a are from the Turkana basin and other 
localities in Kenya and Tanzania (Supplementary Table 1). Large numbers of 
determinations are available in the published sources*'; where appropriate, mul- 
tiple determinations from the same sample or in some cases same horizon have 
been averaged. Fig. 2a shows considerable inter-site variation in the 8'°C signal, 
and hence implied vegetation, even between regionally close localities (exemplified 
by those around Lake Turkana). However, it is notable that despite the offset in 
absolute values between localities, the trend to reduced 5'°C after 4 Myr ago is 
seen in each Formation (Koobi Fora, East Turkana; Nachukui, West Turkana; 
Shungura, Omo Valley). Additional determinations have been published for some 
of these localities and corroborate the pattern in Fig. 2a. Examples include Bed I at 
Olduvai (~1.8 Myr ago), with 3513C values in the region —4 to —6%p (ref. 32), and 
additional data from Koobi Fora in the interval 2.0-1.5 Myr ago corroborating a 
81°C trend from about —7 to about —5%po (ref. 33). Further afield, a large set of 
palaeosol carbonate 5'°C values from the Awash valley, Ethiopia shows some 
differences from the Omo-Turkana Basin, but with the same broad trend of 
opening habitat over the past 4 Myr**4, indeed with more positive 8'°C values 
than Turkana at some points in the sequence. Deposits in the region of 19-18 Myr 
ago at Rusiga Island (Kenya) have pedogenic carbonate 6'°C values clustered 
around —8 to —10%bo, indicating woodland as expected for this time interval 
but more positive than at Fort Ternan (~14 Myr ago), interpreted as reflecting 
a C3 vegetation with reduced photosynthetic fractionation due to a semi-arid 
episode evidenced by sedimentology”’. Another series of early samples with rela- 
tively positive 5'°C values (about —5.2 to —7.4%o) came from the Muruyur 
Formation of the Tugen Hills, and was interpreted as reflecting a mixed C3/C4 
vegetation as early as 15 Myr ago**. However, these results have been disputed in 
terms of the origin and age of the analysed carbonates and other palaeobotanical 
and faunal evidence indicating forest in the region at the time*”**. Somewhat later, 
the wide range of values at Lothagam, South Turkana (Fig. 2a) implies a local 
expansion of relatively open (wooded grassland) habitat, earlier than in other 
areas”. 

Despite the general trend towards increasingly open environments through the 
Pliocene, palaeobotanical and faunal evidence indicates that African environ- 
ments retained a mosaic character throughout the late Cenozoic, with some locally 
wooded or even forested areas even after the spread of grassland*’.The Pro- 
boscidean family Deinotheriidae retained a strongly browsing 5'°C enamel sig- 
nature throughout the succession (Supplementary Fig. 1), implying persistence 
within a woodland habitat. This is seen both in Prodeinotherium in the interval 18- 
12 Myr ago, with 5'°C values similar to those of contemporary gomphotheres, and 
its likely descendent Deinotherium in the interval 5-1 Myr ago, when contempor- 
ary gomphotheres and elephants, often preserved at the same localities (Sup- 
plementary Table 2), show strongly mixed-feeding to grazing diets. 
Morphometrics, taxonomy and identification. Only East African proboscidean 
species for which 5'°C enamel data are available are included in the morphometric 
analysis. Dental parameters are calculated from literature data and original mea- 
surements. Hypsodonty index is 100 X maximum unworn height of upper M3 
crown divided by maximum crown width including cement’. Transverse cusp 
pairs in gomphotheres, lophs in Stegodon and early elephants, and lamellae 
(plates) in later elephants are considered homologous; their number excludes 
‘talons’ and is pooled for upper and lower M3. Sample ages are based mainly on 
radiometrically-dated volcanic tuffs (Supplementary Table 3). 

Proboscidean taxa are plotted by genus in Fig. 2, but individual species (and in 
some cases subspecies) designations can be found in Supplementary Tables 2 and 
3. For most genera, only a single species occurs in East Africa in the time period 
considered. Exceptions are included in the following alphabetical list of taxonomic 
notes and revisions: 

Afrochoerodon. The trilophodont gomphothere A. kisumuensis was described 
by MacInnes"’ as Trilophodon angustidens kisumuensis. It was recognized as a 
choerolophodont by Tassy** who named it Choerolophodon kisumuensis, and 
placed in Afrochoerodon by Pickford’. 

Anancus. Most anancines are allocated to A. kenyensis (MacInnes, 1942) except 
for some advanced specimens recently allocated to Anancus ultimus Sanders, 2011 
(ref. 28). These are marked separately in Fig. 2c. 
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Anancus/Choerolophodon. A specimen from Nakali, Kenya, dated to 10.0- 
8.8 Myr, has been listed and discussed as Anancus’*. However, this is considerably 
earlier than the otherwise known entry of Anancus into Africa®“’, and in a revised 
faunal list for the site only Deinotherium, Choerolophodon and indeterminate 
Elephantidae have been identified*. Y. Kunimatsu (personal communication, 
2011) indicates that the specimen is a small fragment, not fully identifiable but 
by analogy with other Nakali fauna probably Choerolophodon. 

Gomphotheriidae indet. Specimens plotted as ‘Gomphotheriidae indet. in 
Fig. 2b are fragments of enamel, mainly from Lothagam (Lower Nawata) and 
Namurungule. For the Lower Nawata, Anancus kenyensis (tetra/pentalophodont) 
is the only gomphothere identified**, whereas for Namurungule, only Tetralo- 
phodon and Choerolophodon have been reported*’. These are therefore likely 
identifications but other taxa cannot be excluded. 

Mammuthus. Mammoths are not included in the present study since, although 
they show striking evolution of hypsodonty and lamellar number through the 
Plio-Pleistocene, most of this evolution took place in Eurasia**. There is also at 
present no enamel isotopic data on African Mammuthus. Our current limited 
knowledge of African remains referred to this genus is summarized in ref. 23. 

Palaeoloxodon is represented by P. recki, often named Elephas recki Dietrich, 
1915, but here considered congeneric with Eurasian species of Palaeoloxodon 
Matsumoto”; others prefer subspecific status as E. (P.) recki*'. This taxonomic 
issue will not be resolved without a cladistic analysis of the whole Elephas- 
Palaeoloxodon clade. A series of subspecies of P. recki has been erected*””’, and 
specimens have been listed by their allocated subspecies in Supplementary Tables 
2 and 3. Some authors**** have suggested abandoning these names because their 
morphological and temporal ranges overlap, while others”® retain them as useful 
labels for biochronological stages. There is also disagreement on whether the oldest 
member, E. r. brumpti, should be included in the lineage*'’®. Data have been 
published on some P. recki molars additional to those tabulated here™*”’, but 
these have not been included as their localities and horizons (and hence ages) 
were not individually noted’. Some NHM specimens listed in ref. 2 are from early 
20th century excavations at Kanjera, and hence of uncertain age”, or localized only 
to ‘Homa Mountain’. The species P. iolensis (Pomel, 1895), known mainly from 
North and southern Africa, is considered the terminal member of the African 
Palaeoloxodon \ineage**. The term ‘incremental’ used to describe the morpho- 
logical trajectory is intended to convey change through a series of stages; it is 
preferred to ‘gradual’ which suggests perfect continuity or infinitesimal gradations. 
Nor does the observed incremental change necessarily imply a purely anagenetic 
mode of evolution: it is possible that successive stages evolved in isolated popula- 
tions, that is, as cladogenetic events at or below the species level. 

Primelephas, Stegotetrabelodon and Elephantidae indet. Specimens plotted as 
‘Elephantidae indet.’ in Fig. 2b are fragments of enamel, mainly from Lothagam 
and the Adu-Asa Formation (Awash), likely to be either Primelephas or Stegote- 
trabelodon. P. korotorensis Coppens, 1965 includes P. gomphotheroides Maglio, 
1970 since they are likely synonyms, P. korotorensis taking priority*. Finally, molar 
LT23783 from the Upper Nawata of Lothagam is the holotype of Elephas nawa- 
taensis Tassy 2003 (ref. 46), but has been referred by other authors to Primelephas°. 

Stegodon. This genus, known mainly from Eurasia, is of uncertain systematic 
position; there is debate whether its elephant-like features are synapomorphic or 
convergent*». If convergent, Stegodon is likely to be basal to Gomphotheriidae (as 
shown in Fig. 1), and Elephantidae derived instead from advanced tetralophodont 
gomphotheres such as the specimens form Kakara and Chorora (see below). 

Tetralophodon, and Tetralophodont/Elephantid intermediates. Specimens plot- 
ted under Tetralophodon are tetrabelodont gomphotheres sensu lato. There is 
debate over the allocation of some molars in the 10-9 Myr ago interval that are 
agreed to be morphologically at the borderline between tetralophodonts and ele- 
phantids. These include a molar from the Kakara Formation of Uganda, described 
as a “perfect intermediate between a gomphothere and an elephant”, anda molar 
from Chorora (Awash), considered an advanced tetralophodont gomphothere 
close to the ancestry of elephants®. As their attribution to gomphotheres or 
elephantids is debated, they have been plotted in Fig. 2 as ‘tetralophodont/ele- 
phantid intermediates’. 

Additional minor notes on taxonomy and identification are given in Sup- 
plementary Tables 2 and 3. 

Dental evolution in Loxodonta. The living forest and savannah forms of African 
elephant are currently formally recognized as subspecies, L. africana cyclotis and 
L. africana africana, respectively®, but their distinctiveness based on nuclear DNA 
has led to suggestions that they should be recognized as separate species™. Here 
they are named as species for clarity, although dentally, apart from the smaller size 
of L. cyclotis, they are indistinguishable. The number of lamellae (excluding talons) 
in M3 ranges from 9-13 (mode 10-12) in both forms (refs 65-67 and Sup- 
plementary Table 3). Hypsodonty index in L. africana is typically in the range 
1.7-2.1 (refs 2,68 and Supplementary Table 3).There is little comparative data for 
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L. cyclotis, but two molars from Congo (RMCA 18498 and un-numbered, Royal 
Museum for Central Africa, Belgium) have hypsodonty indices of 1.73 and 2.07, 
respectively (data courtesy of W. Wendelen), and a third (Powell-Cotton Museum 
42) has a hypsodonty index of ~2.19. All of these fall within the L. africana range. 

Suggestions (without data) that L. cyclotis is lower-crowned than L. africana 
africana™ are probably due either to observation of molars partly buried in skulls, 
or to consideration of crown height without reference to other dimensions of the 
tooth. 

In the fossil record, two distinct loxodontine lineages are recognized: on the 
one hand L. adaurora Maglio, 1970, and on the other L. cookei Sanders, 2007, 
L. exoptata (Dietrich, 1941) and L. atlantica (Pomel, 1879), the latter group con- 
sidered more closely related to the living species***. In Fig. 2d the earliest points, at 
6.0 Myr ago, are L. cookei; and of the later points, L. exoptata clusters with higher 
hypsodonty index values than L. adaurora (see Fig. 4). 

Lamellar counts similar to modern appear in L. exoptata from the Upper Laetoli 
Beds at ~3.85-3.60 Myr ago**”® (Supplementary Table 3, Fig. 2c), but M3 hypso- 
donty index in the range 100-150, lower than modern, persists until at least 2 Myr 
ago when the East African record ends (Fig. 2d). Therefore, unless we are missing 
earlier, more advanced populations, the completion of hypsodonty increase to 
modern L. africana and L. cyclotis values occurred after 2 Myr ago. The deduction 
that the progenitors of L. cyclotis must have entered their present niche after this 
date is based on the assumption that the degree of dental advancement seen in 
L. cyclotis (equivalent to L. africana) would not have evolved in a browser living in 
canopy forest. However, the paucity of fossils from the equatorial forest zone 
makes direct testing of this hypothesis difficult. 

There are no later fossils directly relevant to the origin and divergence of the two 
living forms. L. atlantica from the Middle to Late Pleistocene of North and south- 
ern Africa shows large body size and somewhat higher lamellar count (13-14) than 
the living species, but similar hypsodonty index (1.7-2.1, Fig. 4, Supplemen- 
tary Table 3). However, details of its enamel loop structure appear to preclude 


ancestry to living savannah elephants”®. 
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interfertile species in Old and New Worlds 
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Oil palm is the most productive oil-bearing crop. Although it is 
planted on only 5% of the total world vegetable oil acreage, palm oil 
accounts for 33% of vegetable oil and 45% of edible oil worldwide, 
but increased cultivation competes with dwindling rainforest reserves. 
We report the 1.8-gigabase (Gb) genome sequence of the African oil 
palm Elaeis guineensis, the predominant source of worldwide oil 
production. A total of 1.535 Gb of assembled sequence and trans- 
criptome data from 30 tissue types were used to predict at least 
34,802 genes, including oil biosynthesis genes and homologues of 
WRINKLED1 (WRI1), and other transcriptional regulators’, which 
are highly expressed in the kernel. We also report the draft sequence of 
the South American oil palm Elaeis oleifera, which has the same 
number of chromosomes (2n = 32) and produces fertile interspecific 
hybrids with E. guineensis’ but seems to have diverged in the New 
World. Segmental duplications of chromosome arms define the 
palaeotetraploid origin of palm trees. The oil palm sequence enables 
the discovery of genes for important traits as well as somaclonal 
epigenetic alterations that restrict the use of clones in commercial 
plantings’, and should therefore help to achieve sustainability for 
biofuels and edible oils, reducing the rainforest footprint of this 
tropical plantation crop. 

The genus Elaeis (tribe Cocoseae) is in the family Arecaceae*, one of 
the oldest families of flowering plants, with fossils dating from the 
Cretaceous period®. The genus consists of two species, E. guineensis 
from West Africa® and E. oleifera from Central and South America. 
E. guineensis has a higher yield, but E. oleifera has higher unsaturated 
fatty acid content, lower height, and resistance to disease’. Commercial 
cultivation of oil palm commenced on the West African coast in the 
early twentieth century*. In southeast Asia, where it is one of the most 
important commercial crops, the first recorded oil palm was brought 
from Africa through Mauritius and Amsterdam in 1848 (ref. 9), when 
four seedlings were planted as ornamentals in the Bogor Botanical 
Gardens in Java. Commercial cultivation began in the early twentieth 
century and despite the long breeding cycle (10 to 12 years) and large 
land requirement for field trials'®, high yield breeding materials (up 
to 12tonnes per hectare per year (t ha ‘yr ') (ref. 9)) have been 
developed in less than 100 years. As such, the largely undomesticated 
oil palm is an ideal candidate for genomic-based tools including 
expressed sequence tags (ESTs)'’"'* and transcriptome sequencing of 
the oil palm fruit during development, maturation and ripening’”* to 
harness the potential of this remarkably productive crop. 

We sequenced the approximately 1.8-Gb E. guineensis genome 
(AVROS (Algemene Vereniging van Rubberplanter ter Oostkust van 
Sumatra) pisifera fruit form) to high coverage with a combination of 
Roche/454 GS FLX Titanium (Roche/454) and Sanger bacterial artificial 


chromosome (BAC) end sequencing (Supplementary Figs 1-3, Sup- 
plementary Tables 1-5 and Methods). The combined total length of 
the assembly (a fifth genome build or P5-build) is 1.535 Gb. Com- 
parison of the P5-build to genetic linkage maps (Supplementary Fig. 4 
and Methods) resulted in 16 genetic scaffolds (one per chromosome; 
43% of the P5 scaffold assembly) for a final total of 40,072 scaffolds 
with an N50 (the scaffold size above which 50% of the total length of the 
sequence assembly can be found) of 1.27 megabases (Mb) (Supplemen- 
tary Table 4) and 74% of RefSeq supported genes. For comparison, the 
genome of E. oleifera was also sequenced with a combination of frag- 
ment and linker libraries (Supplementary Table 2 and Methods). 
Thirty transcriptome libraries were sequenced and assembled, pro- 
ducing 4,528-18,936 isotigs (unique transcript assemblies) per library 
(Supplementary Fig. 5 and Supplementary Table 6). We sequenced 
298,039 reads from methylation-filtered genomic libraries from Deli 
dura and pisifera genotypes of E. guineensis and E. oleifera (Methods). 
Methylation-filtered libraries included 90% of the gene models and 
were enriched 5.6times for genes with a ‘gene space’ of between 
300 and 400 Mb, comparable to that of rice and maize’”. 

The guanine-cytosine content of the E. guineensis genome (37%) is 
similar to that of other plant genomes, including the date palm", but 
genes were conspicuous for having a much higher guanine-cytosine 
content (50%). Gene-finding algorithms (Methods) predicted 158,946 
gene candidates covering 92 Mb of exonic gene space (5% of the 1.8-Gb 
genome sequence) (Supplementary Tables 4 and 5). Of these candi- 
dates, 34,802 were similar to known proteins at the peptide sequence 
level with 96% observed in transcriptome data (Methods). Of the 
remaining 124,144 candidates, 15,311 were identified in transcriptome 
data (Supplementary Table 4). Known retroelements (67,169) and 
other transposons (41,664) made up the remaining 108,833 candidates. 
Comparison to all repetitive element classes resulted in the identifica- 
tion of 775,703 independent genomic regions matching repetitive sequence 
elements, corresponding to 282 Mb of sequence (or 18% of the P5- 
build), with 39% guanine-cytosine content (Supplementary Table 4). 
Repeat content of the unmapped and unassembled contigs was far 
higher, as expected, and estimated to be approximately half of the 
1.53-Gb P5-build, or 57% of the 1.8-Gb E. guineensis genome. 

The 16 EG5 chromosomes (Fig. 1) were numbered according to size 
and compared with previous mapping (Supplementary Table 7) and 
fluorescence in situ hybridization studies (FISH)'’. Gene density (Fig. 1a) 
was distributed unevenly: five of the smallest six chromosomes had one 
gene-rich arm, and one repeat-rich arm, as shown previously with 
FISH”. Known repeat classes matching the TIGR grass repeat database 
and REPBASE were distributed in gene-poor, methylated regions 
(Fig. 1b, c), whereas simple di- and trinucleotide repeats'” were mostly 
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within genic regions (Fig. 1d). Potential centromeric regions were iden- 
tified using an internally repetitive pericentromeric repeat’’ (Fig. le, 
purple), whereas highly conserved TTTAGGG telomeric repeat arrays 
were identified at the extreme ends of 7 of the 32 chromosome arms 
(Fig. le, green). The most prominent 5S ribosomal RNA cluster (Fig. le, 
orange) was found on the largest chromosome, whereas the only telo- 
centric chromosome was one of the two smallest, as described previously”. 
Two interstitial telomere repeat arrays on chromosomes 2 and 14 were 
embedded within putative centromeric regions. Robertsonian fusions 
of telocentric chromosome ends may have given rise to these two 
chromosomes’, and date palm has 18 chromosomes", consistent with 
this hypothesis. Typical of monocot genomes”’, the most abundant 
repetitive elements were copia (33%) and gypsy (8%) retroelements, 
as well as other long terminal repeat (LTR) retrotransposons (6%) 
(Supplementary Fig. 6 and Supplementary Methods). Interestingly, 
47% of all repeats observed were uncharacterized previously, with 
73% absent from E. oleifera and 99% absent from Musa acuminata 
(banana). The distribution of repeats in methylation-filtered reads 
indicated that RIRE1 and other copia elements are especially heavily 
methylated. 

Comparison of E. guineensis chromosomes to each other revealed 
that oriented homologous duplicated sequences (segmental duplica- 
tions) are notably abundant (Fig. 1 and Supplementary Fig. 7). Analysis 
of conserved gene order revealed that the duplications were retained in 
E. oleifera, so that segmental duplications pre-dated the divergence of 
the African and South American oil palm (Supplementary Fig. 8a). 
Given that most of the genome is represented by segmental duplica- 
tions, and not triplications, we conclude that oil palm is a palaeote- 
traploid, in line with speculation based on cytogenetics and restriction 
fragment length polymorphism (RFLP) mapping’’””°. These duplica- 
tions do not span the putative pericentromeric regions (Fig. 1), indi- 
cating that most centromeres arose after polyploidization, consistent 
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with extensive chromosome restructuring. Homologues of 94.4%, 
83.5% and 80.2% of the genes from Phoenix dactylifera, M. acuminata 
and Arabidopsis respectively, were found in E. guineensis (Fig. 2a, 
Supplementary Table 8 and Methods). Each E. guineensis duplication 
matched unique scaffolds in the date palm genome (Supplementary 
Fig. 8b), indicating that date palms have most of the segmental dupli- 
cations found in oil palm. Polyploidization has been inferred by chro- 
mosome counts in only a limited number of Arecaceae”', anda likely 
scenario is that the progenitor of both palms arose as a polyploid. We 
performed a similar analysis of the banana genome” and found extensive 
synteny between each oil palm chromosome and several chromosomes 
from banana (Supplementary Fig. 9), confirming that duplication 
events in the M. acuminata genome occurred after the Musa-Elaeis 
split, as proposed previously”. 

The 34,802 sequence similarity gene predictions (Supplementary 
Table 4) were annotated for gene ontology terms (Fig. 2b, c), with a 
focus on oil biosynthesis (Methods). Oil synthesis in the kernel com- 
mences at 11 to 12 weeks after anthesis (WAA) and is complete by 
15 to 16 WAA at which stage mesocarp oil synthesis starts, reaching a 
peak at 20WAA”. In plants, de novo fatty acid synthesis (FAS) is 
compartmentalized in plastids, whereas triacylglycerol (TAG) syn- 
thesis occurs in the cytoplasm. Although the oil palm accumulates 
markedly higher TAG than the date palm’, the number of genes 
involved in TAG biosynthesis is notably similar in both palms. In 
contrast, FAS genes have higher representation in the oil palm genome 
(Fig. 2b). Apart from the transcriptomes of 30 tissues (Fig. 3a, b, 
Supplementary Fig. 5 and Supplementary Table 6), in-depth sequen- 
cing of kernel and mesocarp (Fig. 3c and Supplementary Table 9) 
indicated that the transcriptome signatures are similar in both tissues 
with plastidial FAS genes upregulated compared to TAG. Thus, the 
enzymes of the Kennedy pathway for TAG assembly must cope 
with the high flux of de novo fatty acids in oil palm. FAS genes were 


Figure 1 | The chromosomes of oil 
palm. E. guineensis has 

16 chromosome pairs, ordered by 
size, which correspond to 16 linkage 
groups identified by genetic mapping 
(Supplementary Table 7). Tracks 
displayed are: a, gene density; 

b, methyl-filtered read density; 

c, retroelement density; d, simple 
sequence repeats; e, low copy number 
repetitive elements, including 
telomere repeat TTTAGGG (green), 
5S rRNA (orange) and 
pericentromeric repeats (purple); 

f, regional G-C content (range 0.3- 
0.45); g, genetically mapped scaffolds 
from the P5-build; and h, segmental 
duplications. Densities for telomere 
repeats are exaggerated for visual 


clarity. 
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Figure 2 | Gene model comparisons. a, Venn Diagram illustrating the 
proportion of shared gene family clusters in E. guineensis, M. acuminata, 
A. thaliana and P. dactylifera. Number of genes (clusters) compared were: 
A. thaliana, 27,416 (15,399); M. acuminata, 36,529 (16,874); P. dactylifera, 
28,882 (15,979) and E. guineensis, 34,802 (16,374). b, Gene ontology 
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classifications of oil palm and date palm. c, Ratio of gene number (oil palm:date 
palm) in each gene ontology classification. Deg, degradation; FA, fatty acid; 
hexoses-P, hexose phosphate pathway; mal and pyr met, malate and pyruvate 
metabolism; mt, mitochondrial; OPP, oxidative pentose phosphate; pt PLP syn, 
plastidial phospholipid synthesis; TAG, triacylglycerol; TPs, transporters. 


Figure 3 | Lipid and carbohydrate 
metabolism in oil palm fruits. 

a, b, Number of lipid-synthesis- 
related (a) and carbohydrate- 
synthesis-related (b) gene transcripts 
in different tissues. c, Comparison of 
gene-expression levels between 
kernel and mesocarp tissue before 
and at peak of oil accumulation. 
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Figure 4 | Phylogenetic analysis. A carefully annotated subset of proteins 
from E. guineensis, E. oleifera and P. dactylifera were included in a matrix of 
1,685 gene partitions (858,954 patterns) and 107 taxa. This matrix is extracted 
from partitions with at least 30 taxa present in a much larger matrix. 


upregulated markedly just before the onset, and then declined during 
the peak oflipid accumulation. This contrasts with previous reports that 
FAS transcripts continue to increase in the mesocarp’. This may reflect 
the tenera fruit form used in this study, as dura has a longer maturation 
period (22 WAA)*™ than tenera (20 WAA)**”°. WRI regulates oil accu- 
mulation in the oil palm mesocarp' and we found highest mesocarp 
expression of WRI1 just before lipid accumulation onset. However, 
kernel showed 75-fold higher WRI1 expression compared to mesocarp 
(Fig. 3c and Supplementary Table 9), implying its pivotal role in kernel 
oil synthesis. LEAFY COTYLEDONI1 (LEC1), LEC2, ABSCISIC ACID 
INSENSITIVE3 (ABI3) and FUSCA3, which activate WRIJ in oilseeds, 
were not found in mesocarp transcriptomes, but were well-represented 
in the kernel, with LEC1 and ABI3 showing higher expression at the 
start and completion of oil accumulation, respectively. Interestingly, the 
transcriptional regulator PICKLE (PKL) was expressed in both the 
mesocarp and kernel (Supplementary Table 9). 

Genes involved in sucrose degradation and the oxidative pentose 
phosphate pathway were more highly represented in oil palm than 
date palm (Fig. 2b). Pentose phosphates are recycled into glucose 
6-phosphate to fuel glycolytic pathways, and import of these cytosolic 
metabolites requires specific transporters on the plastid envelope. 
Although these transporter genes are upregulated strongly in oil palm’, 
they are similarly represented in date and oil palm genomes (Fig. 2b). 
Thus, channelling of sugars destined for oil synthesis is regulated at the 
transcriptome level in oil palm. Additional insights important to TAG 
biosynthesis, fruit ripening and abscission are provided in Sup- 
plementary Notes. 

To place palms on the evolutionary tree, evidence-based gene models 
from each species were combined with a previous seed plant data 
set*® to form a matrix of 1,685 gene partitions (858,954 patterns) and 
107 taxa. P. dactylifera, E. guineensis and E. oleifera are present in 1,206, 
1,229 and 1,190 partitions, respectively. All three were well separated 
from other monocots (Fig. 4), including nearest neighbours Musa (banana), 
Curcuma (turmeric) and Zingiber (ginger). Phylogenetic dating using 
conservative constraints (Methods) predicted a divergence 65 million 
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0.3 


A maximum likelihood tree of monocotyledonous taxa is shown, along with 
bootstrap values when less than 100 (Methods). Scale bar indicates the mean 
number of substitutions per site. 


years (Myr) ago between date and oil palm, and 51 Myr between E. oleifera 
and E. guineensis. This is comparable with divergence between Old and 
New world relatives such as African Sorghum bicolor (sorghum) and 
American Zea mays (maize) panicoid grasses (26 Myr). Unlike maize 
and sorghum, however, E. guineensis and E. oleifera give rise to fertile 
hybrids’, consistent with the vicariant hypothesis for phylogeographi- 
cal divergence, in which geographically isolated species are under no 
selective pressure to evolve reproductive isolation”. 

The genome sequence of oil palm will be a rich resource for oil palm 
breeders, geneticists and evolutionary biologists alike. It has revealed 
that palms are ancient tetraploids, and that the African and South 
American species probably diverged in the Old and New Worlds. Over- 
represented genes in lipid and carbohydrate metabolism are expressed 
differentially in mesocarp and kernel, accounting for the different 
properties of palm fruit and palm kernel oils. The genome sequence 
will also enable mapping of somaclonal epigenetic alterations that 
restrict the use of clones in commercial plantings. The dense repres- 
entation of sequenced scaffolds on the genetic map will facilitate iden- 
tification of genes responsible for important yield and quality traits. 
The genome sequence of this tropical plantation crop is an important 
step in achieving the goals that are critical to the sustainability chal- 
lenges associated with growing demands for biofuels and edible oils. 


METHODS SUMMARY 

Genome sequencing and assembly, genetic mapping and gene annotation. 
Sequencing reads (Roche/454) were generated from genomic and BAC pool 
DNA fragment and paired-end linker libraries (Supplementary Figs 1-3 and 
Supplementary Tables 1-3). BAC end sequencing was performed using the 
Sanger method. Sequence data were assembled as described (Methods). The gen- 
etic map for the selfed Nigerian tenera palm T128 was constructed as described 
(Methods). Gene predictions, genome and protein comparisons and gene onto- 
logy annotation were performed as described (Methods). 

Transcriptome sequencing. Thirty transcriptome libraries were constructed and 
transcriptome sequences were generated (Roche/454) (Methods). Transcriptome 
libraries from kernel and mesocarp were constructed and deep sequenced by 
Illumina HiSeq 2000 (Methods). 
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Methylation filtration. Methylation filtration libraries were constructed as 
described’ and resulting clones were sequenced using the Sanger method (Methods). 
Phylogenetic analysis. Open reading frames were assembled from evidence- 
based gene models and combined with a large seed plant data set*® (Methods). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Genome assembly. The assembly of E. guineensis (AVROS, pisifera fruit form) 
genome P5-build was constructed from sequences from a total of 148 linker lib- 
raries and 81 fragment libraries (Roche/454). Reads were generated from genomic 
DNA fragment libraries (53.5 million reads), BAC pool DNA fragment libraries 
(3.6 million reads), and from a series of genomic (89.1 million reads) and BAC 
(8.6 million reads) paired-end linker libraries. In total, 46.8 billion bases of raw 
sequence were generated, representing approximately 26-fold raw sequence cov- 
erage of the 1.8-Gb oil palm genome. Sequence data were assembled using the 
Newbler algorithm* (Supplementary Table 4). The seventh assembly and genome 
build of E. oleifera (O7-build) was constructed from a total of 127 linker libraries 
and 68 fragment libraries. In total, 130 million E. oleifera reads, representing 
approximately 25-fold raw sequence coverage, were generated (Supplementary 
Table 2). 

High information content restriction fragment fingerprints were generated for 
124,286 BAC clones from the E. guineensis genome with an average size of 150 kb. 
BAC fingerprints were used to construct a physical map of the reference genome 
(Supplementary Figs 2 and 3). In addition, BAC ends from this library were 
sequenced using Sanger sequencing to generate 235,613 paired reads with an 
average read length of approximately 600 bp. BAC end reads were used to create 
the shotgun assembly, as well as to locate individual BACs within the assembled 
genome. 

Before assembly, all Roche/454 sequence runs were screened for quality based 
on average read length, linker library efficiency and library redundancy using a 
custom pipeline based on the SSAHA program”. For linker positive reads, library 
insert sizes were estimated by aligning both ends to a draft assembly of oil palm, 
and measuring the intervening sequence in cases in which both ends matched a 
single draft scaffold. In order to minimize the negative impact of chimaeras in the 
linker libraries, we removed all identical reads that were due to library redundancy, 
not independent observations. In addition to the Roche/454 data, a set of 235,613 BAC 
end reads from the Origen_1 BAC library were included in the P5 assembly, with 
an average BAC size of 150kb. These data were assembled using the Newbler 
algorithm” on a Dell PowerEdge R910 server with 512 Gb of RAM, and 32 cores 
running Ubuntu Linux 10.04. The P5 assembly took 15 days and 5h to complete. 
Genetic map construction. Two genetic maps were constructed. The first map- 
ping population was derived from the self-pollination of the Nigerian tenera palm 
T128. The mapping population and map-construction methodology are described 
elsewhere*®. The second mapping population comprised 87 palms obtained from a 
cross between Ulu Remis Deli dura (ENL48) and Yangambi pisifera (ML161) grown 
at the FELDA Research Station at Jerantut, Malaysia. Linkage analysis for the 
dura X pisifera (P2) cross was performed using both JoinMap 4.0 and GenStat 
14th edition. JoinMap was used to examine markers and identify loci showing 
distorted segregation (chi-squared test). JoinMap was initially used to construct 
the two parental framework maps at recombination frequency = 0.2 anda nearest 
neighbour stress value of = 4 (centimorgan) using the maximum likelihood map- 
ping algorithm as described’!. The density of the linkage maps was later increased 
by mapping additional co-dominant markers into the parental framework maps 
using the maximum likelihood mapping algorithm in GenStat 14. The integrated 
map was built using the maximum likelihood mapping method in GenStat14 by 
combining data from markers on both the two parental maps. Comparison 
between the integrated map and the parental maps was visualized using 
MapChart 2.2 (ref. 32). 

Genetic-map integration and chromosome-sequence construction. The 
1,511 markers used in the generation of the T128_codominant and P2_DxP maps 
were compared to scaffolds from the P5-build using the exonerate*’ program with 
an ungapped alignment and a minimum identity match of 97%. Markers that did 
not uniquely map to P5 scaffolds were discarded, and one scaffold marker ordering 
was created for each of the two maps. After reviewing shared scaffolds between the 
two maps, a final order was determined for ordering 169 scaffolds, and ordering 
and orienting 124 scaffolds based on multiple markers. The scaffold sequences 
were then concatenated in order and reverse complemented as required to create 
16 linkage groups based chromosome sequences. During map integration, LG15 
(T128_codominant naming convention) appeared significantly shorter than a 
previous integration based on the P4 assembly with the P2_regression map. This 
was owing to map instability introduced into the P2_DxP map generation in which 
the P2_regression was more stable. After review, the LG15 chromosome sequence 
was extended based on mapping of P5 scaffolds onto the P4/P2_regression version. 
Gene identification and annotation. Based on the longest set of scaffolds repre- 
senting 10% of the original P1-build, we used the SNAP* gene finder to identify 
initial candidate gene predictions for the E. guineensis genome. Initial SNAP runs 
were performed using the rice (Os.hmm) gene model. Initial genes discovered 
were compared with the RefSeq35 database as well as the TIGR Gramineae repeat 
database in order to remove retroelements and pseudogenes. The remaining 


transcripts were then screened for missing start and stop codons, as well as other 
warnings from SNAP. The remaining set was then used as input to the SNAP 
programs FATHOM and FORGE according to the SNAP documentation to create 
a new hidden Markov model (HMM) with greater specificity to E. guineensis. The 
same screening process was applied again and four iterations of the training were 
used resulting in the final ‘pisif_2_22_11.hmm’ gene model. 

For the Glimmer* predictions, assembled E. guineensis transcriptome sequences 
(groups A, C, D and G in Supplementary Table 6) were translated from start to stop 
with a size selection ranging from 500 to 5,000 nucleotides. These were then com- 
pared to complete coding sequences for Magnoliophyta from GenBank using 
BLASTX (E-value < 107 '°). Transcripts with homology starting at position one 
of the Magnoliophyta targets were selected for further analysis. The transcripts 
were then screened using BLASTClust (NCBI) and CD-HIT* to reduce the number 
of genes to meet Glimmer’s training requirements. Exon boundaries were determined 
by mapping to the previous P4-build, and were used to create an E. guineensis 
HMM. 

Transcriptome analysis. Reads (Roche/454) from each library and from all libraries 
(Fig. 3a, b and Supplementary Table 6) were assembled into isotigs respectively. 
The isotigs from each library were blasted on A. thaliana gene models with a 
threshold of E-value <10~°. The best hit A. thaliana gene model was assigned 
to the homologue of the query isotig. Based on Bourgis’ annotation’, copy numbers 
were given to each gene in each category group. The final copy numbers of each 
functional group were scaled on the number of genes in each group. To estimate 
expression level of genes in mesocarp and kernel tissues, Illumina HiSeq 2000 reads 
from each library were mapped to assembled isotigs from all E. guineensis reads by 
using the Burrows-Wheeler Aligner. Gene group expression levels were calculated 
as the number of mapped reads on each isotig divided by the total number of 
isotigs, multiplied by 100,000, and scaled by the number of genes in each gene 
group. Both copy number and read coverage were the mean of measures from two 
biological replicates. Data were analysed as described above for Roche/454 data, 
except that expression levels were calculated as transcripts per million tags. 
Methylation-filtered library analysis. Methylation-filtered (‘GeneThresher’) 
genomic DNA libraries were constructed as described’*”” to select unmethylated 
clones (depleted of most repetitive sequences*’) by propagation in McrBC* strains 
of Escherichia coli. Briefly, nuclear DNA was extracted individually from young 
leaves of Deli dura and AVROS pisifera of E. guineensis and from E. oleifera. For 
each of the three DNA populations, genomic shotgun libraries were constructed as 
described’. Sequences were generated from one end of each cloned insert by ABI 
3730 sequencing (Life Technologies), generating 298,039 reads (73,390 from Deli 
dura, 101,327 from AVROS pisifera and 123,322 from E. oleifera). 

Segmental duplication analysis. Chromosomes from the EG5-linked assembly 
were screened in a self—self comparison test using the MUMmer3 set of tools”. 
Final alignments were carried out on chromosome pairs using the PROmer pro- 
gram (optional parameters — d 0.5 -c 200), alignments were reviewed visually with 
the MUMmer plot program, and approximate boundaries for segmental duplica- 
tions were recorded (Supplementary Fig. 7 and Supplementary Table 7). 

To test for the existence of observed segmental duplications in other genomes, 
we performed comparative genomics of each half of the 16 segmental duplications 
in E. guineensis with the E. oleifera and P. dactylifera scaffold sets. Comparisons 
were performed using a custom analysis pipeline based on the MUMmer program 
(optional parameters —n -] 30 —-b -c -L). MUMmer output was summarized as an 
offset-sorted overlap plot showing where query scaffolds share local alignment 
with a reference chromosome. Output was reviewed to verify that each half of the 
proposed segmental duplication was present in the query genome, and that the 
scaffolds matching were different for each half of the duplication in E. guineensis. 
Genome comparison by gene models. NCBI TBLASTN program was used to 
compare A. thaliana, O. sativa, P. dactylifera and E. guineensis predicted proteins 
on the genomes of each of the four species, with a threshold of E-value < 10>. At 
this level of conservation, matches represent shared gene families between the 
query and target genomes. By comparing predicted proteins to genome sequence, 
biases introduced by gene prediction methods are minimized compared to a direct 
gene-level comparison. Comparisons between the gene models from one species 
and its own genome are close to, but less than 100%, owing to the limit of sens- 
itivity of TBLASTN at this E-value cutoff. Public database sources for comparative 
genome sequences and gene models are provided in Supplementary Methods. 
Gene clustering and Venn diagram. CD-HIT clustering algorithm”* was used to 
look for homologous protein sequences among A. thaliana, O. sativa, P. dactylifera 
and E. guineensis at 40% similarity level. This algorithm avoids all-versus-all BLAST 
search by using a short word filter. 

Gene ontology annotation. E. guineensis gene model protein sequences were 
queried against A. thaliana annotated gene model protein sequences by using 
BLASTP“ with a threshold of E-value< 10°. The blast output file was then 
loaded into Blast2GO"'. Blast2GO performed gene ontology annotation by using 
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an annotation rule on the found ontology terms. The most specific annotations 
were assigned on each sequence with default parameters in the annotation rule. 
Phylogenetic analysis. A carefully annotated set of open reading frames was 
assembled from evidence-based gene models in each species. The oil palm open 
reading frames were combined with a large seed plant data set published prev- 
iously*®, with updated gene models for Arabidopsis thaliana, Oryza sativa and 
Solanum lycopersicum, as well as the addition of P. dactylifera, M. acuminata, 
Carica papaya and Selaginella molendorffii (outgroup) for a total of 107 taxa. 
Orthologues were identified using OrthologID”, and a sub-matrix with at least 
30 taxa present per partition was extracted for phylogenetic analysis. This sub-matrix 
contains 1,685 gene partitions with 858,954 patterns. Maximum likelihood analysis 
was carried out using RAxML* with the JTT+F+I model and 100 bootstrap 
replicates. 


28. Margulies, M. et al. Genome sequencing in microfabricated high-density picolitre 
reactors. Nature 437, 376-380 (2005). 

29. Ning, Z., Cox, A.J. & Mullikin, J. C. SSAHA: a fast search method for large DNA 
databases. Genome Res. 11, 1725-1729 (2001). 

30. Singh, R. etal. The oil palm SHELL gene controls oil yield and encodes a homologue 
of SEEDSTICK. Nature http://dx.doi.org/10.1038/nature12356 (this issue). 

31. Jansen, J. Construction of linkage maps in full-sib families of diploid outbreeding 
species by minimizing the number of recombinations in hidden inheritance 
vectors. Genetics 170, 2013-2025 (2005). 


32. 


33. 


34. 


35: 


36. 


37: 


38. 


39. 


40. 


4l. 


42. 


43. 


LETTER 


presentation of linkage maps 


Voorrips, R. E. MapChart: software for the graphica 
and QTLs. J. Hered. 93, 77-78 (2002). 

Slater, G.S. & Birney, E. Automated generation of heuristics for biological sequence 
comparison. BMC Bioinformatics 6, 31 (2005). 

Korf, |. Gene finding in novel genomes. BMC Bioinformatics 5, 59 (2004). 
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two 
open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878-2879 
(2004). 
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of 
protein or nucleotide sequences. Bioinformatics 22, 1658-1659 (2006). 

Bedell, J.A. etal. Sorghum genome sequencing by methylation filtration. PLoS Biol. 
3, e13 (2005). 

Ouyang, S. & Buell, C. R. The TIGR Plant Repeat Databases: a collective resource for 
the identification of repetitive sequences in plants. Nucleic Acids Res. 32, 
D360-D363 (2004). 

Kurtz, S. et a/. Versatile and open software for comparing large genomes. Genome 
Biol. 5, R12 (2004). 

Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Res. 25, 3389-3402 (1997). 

Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and 
analysis in functional genomics research. Bioinformatics 21, 3674-3676 (2005). 
Chiu, J. C. et a/. OrthologID: automation of genome-scale ortholog identification 
within a parsimony framework. Bioinformatics 22, 699-707 (2006). 

Stamatakis, A. RAXML-VI-HPC: maximum likelihood-based phylogenetic analyses 
with thousands of taxa and mixed models. Bioinformatics 22, 2688-2690 (2006). 


©2013 Macmillan Publishers Limited. All rights reserved 


1 sid il Be 


doi:10.1038/nature12356 


The oil palm SHELL gene controls oil yield and 
encodes a homologue of SEEDSTICK 


Rajinder Singh’, Eng-Ti Leslie Low’, Leslie Cheng-Li Ooi!, Meilina Ong-Abdullah', Ngoot-Chin Ting', Jayanthi Nagappan’, 
Rajanaidu Nookiah’, Mohd Din Amiruddin’, Rozana Rosli', Mohamad Arif Abdul Manaf", Kuang-Lim Chan!, Mohd Amin Halim’, 
Norazah Azizi’, Nathan Lakey”, Steven W. Smith”, Muhammad A. Budiman’?, Michael Hogan?, Blaire Bacher”, Andrew Van Brunt’, 
Chunyan Wang’, Jared M. Ordway’, Ravigadevi Sambanthamurthi' & Robert A. Martienssen* 


A key event in the domestication and breeding of the oil palm 
Elaeis guineensis was loss of the thick coconut-like shell surround- 
ing the kernel. Modern E. guineensis has three fruit forms, dura 
(thick-shelled), pisifera (shell-less) and tenera (thin-shelled), a 
hybrid between dura and pisifera’*. The pisifera palm is usually 
female-sterile. The tenera palm yields far more oil than dura, and is 
the basis for commercial palm oil production in all of southeast 
Asia’. Here we describe the mapping and identification of the SHELL 
gene responsible for the different fruit forms. Using homozygosity 
mapping by sequencing, we found two independent mutations in 
the DNA-binding domain of a homologue of the MADS-box gene 
SEEDSTICK (STK, also known as AGAMOUS-LIKE 11), which 
controls ovule identity and seed development in Arabidopsis. The 
SHELL gene is responsible for the tenera phenotype in both culti- 
vated and wild palms from sub-Saharan Africa, and our findings 
provide a genetic explanation for the single gene hybrid vigour (or 
heterosis) attributed to SHELL, via heterodimerization. This gene 
mutation explains the single most important economic trait in oil 
palm, and has implications for the competing interests of global 
edible oil production, biofuels and rainforest conservation’. 

Oil palm fruits (drupes) are derived from three fused carpels and 
consist of epicarp, mesocarp and endocarp tissues surrounding one or 
more kernels. Hybrids (tenera) between dura and pisifera (Fig. 1) have 
a distinct fibre ring that surrounds the coconut-like shell of the oil palm 
seed’. The SHELL gene responsible for this phenotype has co-dominant 
monogenic inheritance, first reported in the Belgian Congo in the 1940s 
(ref. 7). However, tenera fruit forms were recognized and exploited in 
Africa well before then*®. Given the central role played by the SHELL 
gene, oil palm breeding uses reciprocal recurrent selection of maternal 
(dura) and paternal (pisifera) pools*. The Deli dura population, direct 
descendants of the four original African palms planted in the Bogor 
Botanical Garden, Indonesia (1848), has excellent combining ability 
with the AVROS (Algemene Vereniging van Rubberplanters ter 
Oostkust van Sumatra) pisifera and other pisifera parental palms. 
AVROS pisifera palms were derived from the famous “Djongo’ palm 
from Congo, but more recently several different accessions of dura and 
tenera have also been sourced from Africa*. Tenera palms are thought 
to have been selected by pre-colonial cultures in West Africa owing to 
their higher oil yields, and are the basis for modern oil palm breeding’. 

The SHELL gene lies 4.7 and 9.8 cM away from the closest molecular 
markers’ "', but has proven extremely challenging to identify given the 
large genome, long generation times and difficulty of phenotyping in 
experimental populations of oil palm, which are widely distributed 
among different plantations (Methods). We used a two-tiered approach 
to identify the SHELL gene, taking advantage of the recently completed 
oil palm genome sequence”. First, 240 F, progeny, derived by con- 
trolled self-pollination of the Nigerian tenera accession T128 and 
grown over two decades in plantations throughout Malaysia, were 


scored for fruit form phenotype’’. In addition to 200 restriction frag- 
ment length polymorphism (RFLP) and simple sequence repeat (SSR) 
markers, the progeny were genotyped for 4,451 single nucleotide 
polymorphism (SNP) markers derived from the oil palm genome 
sequence” by the Infinium iSelect Assay (Illumina). A genetic map 
with 16 linkage groups was constructed, and the SHELL gene locus was 
placed in T128 linkage group 7 (Supplementary Fig. 1), consistent with 
previous studies (Methods), and mapped by sequence similarity to a 
3.4-megabase (Mb) assembly scaffold 43 (p3-sc00043) at the end of 
chromosome 2 (ref. 12). A tiling path of bacterial artificial chromosome 
(BAC) contigs corresponding to scaffold 43 was selected from a high- 
information content physical map of pisifera and sequenced”. Further 
SNP assays were designed from an improved assembly, and additional 
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b 
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Figure 1 | Fruit forms of the African oil palm Elaeis guineensis. a, Dura 
(Sh/Sh) fruit forms have a thick lignified shell surrounding the kernel, which is 
absent in pisifera (sh/sh). F, hybrid palms (Sh/sh) have an intermediate fruit 
form (tenera) that is much higher yielding than either parent in terms of 
mesocarp oil. b, Dura, tenera and pisifera fruit stained for lignin (red) with 
phloroglucinol. 
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genotypes incorporated into the map. Ina review of single marker data, 
recombinant breakpoints were identified, indicating that the gene lay 
in a 450-kilobase (kb) interval (Methods). 

Next, we used homozygosity mapping using the AVROS pedigree 
(Fig. 2a) and whole-genome resequencing (Methods). In this technique, 
candidate genes appear as regions with low diversity in homozygous 
inbred individuals’*’*. There were 14 individual pisifera palm genomes 
sequenced at 20x genomic coverage, and 29 additional pisifera palms 
sequenced as a pool (pool 1) at 40 coverage (Iumina HiSeq 2000). 
SNPs were called throughout the genome, and those in scaffold 43 were 
scored for homozygosity. The resulting homozygosity plot had a local 
minimum of 200 kb (centred on 350,000 base pairs (bp) in p3-sc00043; 
Fig. 2b, c). This 200-kb region contained about 30 annotated genes, 
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Figure 2 | Homozygosity mapping of the shell trait. a, AVROS pedigree 
palms heterozygous (tenera, pink) and homozygous (pisifera, blue and green) 
for SHELL are indicated, linked to parents and progeny planted over five 
decades. Individually sequenced palms are indicated (green). Three additional 
pisifera palms from this extended pedigree were also individually sequenced 
(not shown). The remaining pisifera palms were sequenced as a pool. b, The 
SHELL locus was mapped onto scaffold p3-sc00043 (3.4 Mb), and SNP density 
was plotted along the scaffold (top). Markers surrounding SHELL are plotted 
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only five of which were fully homozygous, and only one of which lay in 
the genetic interval containing SHELL (Fig. 2c). This gene encodes a 
homologue of SEEDSTICK, which is responsible for ovule and seed 
development in Arabidopsis'*"’” (Fig. 2c and Supplementary Fig. 2). 
PCR amplicon sequencing (Methods) identified allelic differences 
between SHELL in Deli dura (Sh?*!P™), and in the AVROS (sh4V8°°) 
and T128 (sh?) pisifera haplotypes derived from Congo and Nigeria, 
respectively (Fig. 3). A nucleotide substitution in shMO8 results in a 
leucine-to-proline amino acid change in the conserved DNA binding 
and dimerization domain, whereas a substitution in sh’VROS results in 
a lysine-to-asparagine amino acid change in the same domain, only 
two amino acids carboxy-terminal to the shMPOR change (Fig. 3). In 
related proteins, this highly conserved lysine residue is involved in 


Year of planting 
From Indonesia 


early 1950s 


1957 


1980s 


Late 
1990s 


TTFHS 404 Ayjiqeyoipeid Jsey1e/\ 


500,000 600,000 
Position on scaffold 43 (bp) 


“CBee 


SHELL 


,000 


against the percentage of pisifera palms with the SNP haplotype in the Nigerian 
T128 F, mapping population. SHELL was mapped between flanking markers, 
denoted by blue diamonds, at 400 kb and 1 Mb. A local diversity minimum of 
200 kb was found within this interval (inset). c, Genes (green) and transposons 
(red) are indicated as boxes. The local diversity minimum contained four 
homozygous genes (boxed), only one of which (SHELL) was located between 
the flanking markers in b. 
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Out groups: 1 . : é 58 
A. thaliana (STK/AGL11) GRGKIEIKRIENSTNROVTFCKRRNGLLKKAYELSVLCDAEVALIVFSTRGRLYEYAN 
A. thaliana (SHPWVAGLI) he Sees TD eed weak ie aicay ag Geel MK Szeged tel a NAD iss ens aod stolen 
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Oil palm, dura allele: 
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Nigeria (FK7/005-T128) Ss www ww ee eee aa eee wares || aeaint aera © ere a war Ci eee reese 
Nigeria (GGN23) ee eta aces wa er ene ae ee ee ee 
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Congo (AVROS) nee ee ee eee TD S'S auenateeddeavdsiendualllh ID cai S one nde forideatc'ay th cey ai-Savenes Sea ayincacereerctece 
Tanzania (0.256/238).- int eane eee e re Bi dese SoM RRR ee 


Figure 3 | Sequence diversity of SHELL. Multiple sequence alignment of 
Arabidopsis STK (AEE82818.1) and SHP1 (P29381.1) proteins, rice protein 
OsMADS13 (AAF13594.1), poplar predicted protein (XP_002327282.1), grape 
MADS-box protein 5 (XP_002281890.1), tomato TAGL1 (AAM33101.2) and 
peach SHATTERPROOF-like (ABG75908.1) (out groups); Elaeis oleifera, 
Elaeis guineensis and exemplars of Deli dura (oil palm, dura allele); and pisifera 


nuclear localization, and direct DNA binding’*””, whereas the substi- 
tution by a proline only two amino acid residues amino-terminal to this 
position would disrupt the o-helix that is involved in MADS dimeriza- 
tion and DNA binding”. Analysis of an additional 336 palms was used to 
validate these alleles within established phenotyping norms (Methods). 
These included four pisifera Palms from introgression trials of shM?O? 
into tenera carrying the sh*Y®°® allele. Sequencing confirmed that 
these four palms were heteroallelic, indicating that the two alleles failed 
to complement, and confirming the identity of the gene (Methods). To 
explore segregation in E. guineensis populations further, the SHELL 
exon 1 sequence was generated from a diversity panel of 379 palms 
representing nine distinct wild oil palm populations collected from 
Angola*, Madagascar’, Nigeria* and Tanzania‘, a subset of a 110,000- 
accession seed bank collected over the past five decades* (Methods). 
We found that all palms evaluated carried either the ShP*?™*, shAVROS 
or shMP°® alleles in exon 1 (Fig. 3 and Supplementary Fig. 3). 

Analysis of 10,916,126 RNA-seq reads from 22 different libraries’* 
revealed only 159 reads matching SHELL, all of which were found in 
just four libraries: from whole florets one day after anthesis, kernels 10 
and 15 weeks, and mesocarp 15 weeks after anthesis (WAA). In situ 
hybridization was performed on fruits between 1 and 5 WAA (Fig. 4), 
at the earliest stages of shell formation*”®. Uniform but weak hybrid- 
ization signals were detected in the mesocarp of both the thick-shelled 
(dura) and shell-less (pisifera) fruit forms (Fig. 4a, c), but very strong 
signals were detected in the outer layers of the developing kernel only 
in the dura form (Fig. 4a), consistent with the function of SHELL. The 
oil palm shell is heavily lignified (Fig. 1b), forming the endocarp, and 
surrounds the kernel (or pit). It is not found in pisifera palms, which 
are often female sterile. 

The function and expression of the SHELL gene is conserved in higher 
plants’®. In Arabidopsis thaliana, SHATTERPROOF (SHP1) and STK 
are type II MADS-box proteins of the C and D class, respectively, and 
form a network of transcription factors that control differentiation of 
the ovule, seed and lignified endocarp”’. In tomato, SHP1 homologues 
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from Nigerian T128 and Congo AVROS (oil palm, pisifera allele). The pisifera 
fruit form is caused by two disruptive SNPs that affect a highly conserved amino 
acid motif in the MADS-box DNA binding and dimerization domain. Gene 
names (out group examples) or sample identifiers (oil palm examples) are in 
parentheses. 


control fleshy fruit expansion in the endocarp”’, and in peaches, which 
are also drupes, homologues of both genes have been implicated in 
lignified split-pit formation®**. In rice, the SHELL orthologue is 
OsMADS13 (Supplementary Fig. 4), a homologue of Arabidopsis thaliana 
STK* and SHP1 (ref. 26) that controls ovule identity, so that mutants 
are female sterile”. STK and SHP1 bind to DNA as heteromultimers 


SHELL probe 


Negative control probe 


waa 


Figure 4 | Expression of SHELL in early oil palm fruit. a—d, Longitudinal 
sections of 1 WAA dura fruit (a, b) and cross sections of 5 WAA pisifera fruit 
(c, d) were hybridized to a pool of two unique SHELL locked nucleic acid (LNA) 
probes (a, c) or to a scrambled sequence LNA negative control probe 

(b, d). Colourimetric in situ hybridization probe detection is blue, and all 
sections were counterstained red. Structural hallmarks of fruits are indicated by 
labelled arrows in a and c. Scale bar, 0.8 mm. 
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Table 1 | SHELL mutations and protein interactions 


Activation domain fusion Binding domain fusion Interaction 

SpDeliDura SpPelidura : 
shMPOB SpDelibura ‘ 
shAVROS SpDelibura 7 
OsMADS24 OsMADS24 + 
OsMADS24 SpPeliDura rf 
OsMADS24 shMPOB i 
OsMADS24 shAVROS - 


Pisifera fruit form mutations disrupt the interaction of SHELL and OSMADS24. ‘+’ denotes dimerization 
detected in yeast two-hybrid assay; ‘—’ denotes no interaction detected. Experiments were performed 
as described (Methods). Complete data sets are provided in Supplementary Fig. 5 and Supplementary 
Table 1. 


with SEPALLATA (SEP) MADS-box proteins, including OsMADS24 
in rice'®”*, and the highly conserved MADS domain is involved in both 
DNA binding and dimerization'*”’. We postulated that the mutations 
we detected in SHELL could account for the remarkable single-gene 
heterosis exhibited in tenera palms’ if they disrupted heterodimeri- 
zation, as well as DNA binding'®"’. In yeast two-hybrid assays, we 
found that the ShP*#P'™ and sh4VR°S allelic forms of SHELL, but 
not the shMPO8 form, interacted as a heterodimer with OSMADS24. 
Furthermore, OsMADS24 homodimerized with itself, but SHELL 
forms did not homodimerize with each other (Table 1, Supplemen- 
tary Fig. 5 and Supplementary Table 1). If productive heterodimers 
compete with non-productive dimers in heterozygotes, this would 
neatly account for hybrid vigour according to theoretical models”. 
Overdominance at a single locus accounts for similarly remarkable 
increases in hybrid yield in tomato”. 

The unravelling of the genetic basis for the shell-less phenotype 
paves the way for designing molecular strategies for genotyping trees 
that breed true for the phenotype, and modulating SHELL activity for 
desired fruit forms. A marker for SHELL could be used by seed pro- 
ducers to reduce or eliminate dura contamination, and to distinguish 
the dura, tenera and pisifera plants in the nursery long before they are 
field planted—the advantage here being that they could be planted 
separately based on the shell trait. This is useful as the pisifera palms 
have vigorous vegetative growth, and planting them in high density 
encourages male inflorescence development and pollen production. 
As foreseen by the breeder A. Devuyst in the Belgian Congo (in the 
same year that the structure of DNA was solved), accurate genotyping 
for enhanced oil yields will optimize and ultimately reduce the acreage 
devoted to oil palm plantations’, providing an opportunity for conser- 
vation and restoration of dwindling rainforest reserves’. 


METHODS SUMMARY 


Mapping population. The mapping population was derived from the controlled 
self-pollination of the high iodine value (a measure of the level of unsaturation) 
tenera palm (T128) from the Malaysian Palm Oil Board (MPOB) Nigerian germ- 
plasm collection. Ripe fruit bunches were used for phenotyping the three fruit 
forms. DNA from young leaves was purified, and RFLP and SSR analyses were 
performed as described. SNPs were interrogated using the Infinium iSelect II assay 
and GenomeStudio v2010.3 (Illumina). Map construction was carried out with 
JoinMap v4. SNP markers from the genetic map were compared to the draft 
reference pisifera genome sequence, and highest probability SNPs were located 
on scaffold 43 of build 3 (p3-sc00043). Additional scaffold 43 SNPs were designed 
and evaluated in the mapping population using the MassArray iPlex assay 
(Sequenom). 

Homozygosity mapping. Individual palms from the AVROS pedigree were obtained 
from MPOB, Kulim and Sime Darby plantations. The genomes of 14 AVROS 
pisifera palms were independently sequenced, and 29 additional AVROS pisifera 
palms were sequenced as a pool using the HiSeq 2000 instrument (Illumina). 
Individual reads from each tree and the pool were mapped to scaffold 43 from 
the reference genome assembly, and SNPs were identified and summed over 10-kb 
intervals. This scaffold was annotated for genes, and those with the highest homo- 
zygosity were screened for amino acid changes between pisifera and dura. 
SHELL sequencing. PCR amplicons of SHELL exon 1 were sequenced twice each 
in the forward and reverse directions, using the ABI3730 instrument (Life 
Technologies) (Methods). 
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SHELL in situ hybridization. Longitudinal and cross sections were hybridized 
with locked nucleic acid digoxigenin-labelled oligonucleotide probes. Slides were 
developed with alkaline phosphatase and counterstained with Nuclear Fast Red 
(Methods). 

Yeast two-hybrid assays. Oil palm ShPe!PU™, shMPOB, shAVROS and rice OSMADS24 
sequences were each synthesized as two overlapping gBlock gene fragments 
(Integrated DNA Technologies), combined using Gibson Assembly Master Mix 
(NEB), and ligated in-frame into MatchMaker Gold Yeast Two-Hybrid System 
vectors (Clontech). Constructs included the coding sequences of the MADS-box, 
I and K domains of each protein (Methods). Clones were sequence verified by 
Sanger sequencing (ABI). Yeast (AH109) co-transformations and interaction 
assays were performed following MatchMaker Gold protocols (Clontech). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Plant materials and germplasm collection. Oil palm germplasm materials used 
in this study were collected through bilateral agreements between Malaysia and the 
countries of origin, and were in accordance with the Convention on Biological 
Diversity 1992. The reference Deli dura palm was Malaysian Palm Oil Board (MPOB) 
0.212/70, the reference pisifera palm was MPOB 0.182/77, and the reference E. oleifera 
palm was MPOB 0.211/2460. The mapping family used for generating the genetic 
linkage map was derived from controlled self-pollination of the high iodine value 
virescens tenera palm, T128 (accession number MPOB 371), from the MPOB 
Nigerian germplasm collection’. A total of 241 palms were originally planted from 
1993 to 1997 at several locations in Malaysia, namely the MPOB-UKM Research 
Station Bangi at Selangor, MPOB Ulu Paka Research Station at Terengganu, 
MPOB Keratong Research Station at Pahang, MPOB Lahad Datu Research 
Station at Sabah, United Plantations at Perak and the FELDA Research Station 
at Pahang. Of the 241 palms, 240 were still available for marker analysis. The T128 
palm is relevant to the breeding programme as it has outstanding attributes in 
terms of oil yield, fatty acid composition and low height increment. It has been 
crossed with at least six dura, eight tenera and three pisifera palms for progeny 
testing and widening the genetic base. A collection of 379 palms (287 dura, 86 
tenera and 6 pisifera scored fruit forms) from Angola, Congo, Madagascar, Nigeria 
and Tanzania were taken from the diversity panel maintained by MPOB, and 
screened for new alleles as described. Unopened leaf samples (spear leaves) were 
collected from individual palms and immediately frozen in liquid nitrogen and 
stored at —80°C until DNA preparation. DNA was extracted and purified from 
the leaf samples as previously described”’. 

Whole-genome sequencing and assembly. An AVROS pisifera palm was sequenced 
to high coverage’* on the 454 GS FLX Titanium XL+ next-generation sequencing 
platform (454/Roche). Sequence reads were generated from DNA fragment librar- 
ies and from a complex series of linker libraries, in which read ends span fragment 
sizes ranging from 0.75 to >20kb. Sequence reads were assembled with the 
Newbler assembler (Roche 454), producing a reference sequence of the oil palm 
genome’. Scaffolds from the reference assembly containing markers genetically 
mapped in the SHELL interval were identified. A BAC physical map was con- 
structed from a ten-fold BAC library constructed from the same AVROS pisifera 
used to generate the reference sequence. BAC end sequences were also generated 
from each BAC in the library using standard Sanger sequencing on the 3730 
sequencing platform (Life Technologies). BAC end sequences were assembled into 
the reference genome with the Newbler assembler. A minimum tiling path of BAC 
clones spanning the SHELL interval was selected, and BAC clones in the tiling path 
were sequenced in pools to high coverage with 454 XL technology. The BAC pool 
sequences were assembled and merged with the original 454 whole-genome 
shotgun sequence data and all BAC end sequences. Improved scaffold coverage 
and scaffold length spanning the SHELL interval was produced. 

Genetic mapping. A total of 241 progeny palms were derived from self-pollination 
of the Nigerian tenera palm T128 (ref. 4), of which 240 palms were available 
throughout the study. Two palms could not be phenotyped accurately, whereas 
124 tenera, 46 pisifera and 68 dura palms were phenotyped with high confidence at 
the time of genetic map construction. All 240 progeny were scored for 4,451 SNP 
markers using the Infinium iSelect Assay (Illumina), as well as 3 RFLP markers*', 4 
SSR markers” and 193 additional SSR markers developed from the oil palm genome 
sequence’’. The linkage map was then constructed using JoinMap 4.0 (http:// 
www.kyazma.nl/index.php/mc.JoinMap). The genotype data was formatted as 
required for mapping according to an F, population. Markers showing segregation 
profile of 1:2:1 were used in the map construction. 

Two sets of the genotype data were then created, in which one set is the converse 
of the other to account for phase differences in the T128 ‘selfed’ F, population. 
Markers that exhibited severe distortion (P < 0.0001) and markers having more 
than 10% missing data were excluded. Both sets of genotype data were then 
grouped at a recombination frequency of = 0.2. Next, 18 nodes were selected to 
create 18 initial groups for calculating the linkage groups. The linkages were 
calculated and loci ordered based on the maximum likelihood algorithm. None 
of the markers showed severe distortion (P < 0.0001). The few distorted markers 
observed were significant at P<0.05-0.1. These markers were removed from 
further analysis when necessary. Markers exhibiting nearest neighbour stress value 
> 2 (cM) were identified and excluded from the analysis. Markers contributing to 
insufficient linkages were also determined and removed. The T128 co-dominant 
map constructed consisted of 16 groups, and SHELL was placed on linkage group 
7, within the expected region (Supplementary Fig. 1). 

The SNP markers surrounding SHELL were subsequently mapped by sequence 
similarity to a 3.4-Mb assembly scaffold 43 (p3-sc00043). A tiling path of BAC 
contigs corresponding to scaffold 43 was selected from a high-information content 
physical map of pisifera and sequenced’*. An additional 50 SNP assays were 
designed from an improved assembly (p5-sc00060) corresponding to scaffold 
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43. Thirty additional SNP markers were also designed from scaffolds p3- 
sc00191, p3-sc00203 and p3-sc02216, which were also associated with markers 
on linkage group 7, and reassembled as p5-sc00263. These 80 SNP markers 
(designated as SNPE) were genotyped in the T128 selfed population using the 
Sequenom MassArray iPlex platform, and 63 were polymorphic giving a final 
co-dominant genetic linkage map consisting of 818 markers (719 SNP (inclusive 
of SNPE), 96 SSR, 2 RFLP markers and SHELL) in 16 linkage groups. SHELL 
remained in linkage group 7, together with the 63 SNPE markers developed from 
the selected scaffolds. The final size of linkage group 7 is approximately 182 cM, 
with an average of 1.2 cM between two adjacent markers (Supplementary Fig. 1). 

Single marker mapping was used to determine the recombination breakpoints 
surrounding SHELL in each individual palm. To obtain a genetic interval contain- 
ing SHELL, palms with multiple breakpoints in the region and/or inconsistent 
haplotypes were discarded in case of mis-genotyping. Markers flanking SHELL 
were selected using these conservative criteria and found to lie 450kb apart on 
scaffold p3-sc00043 (Fig. 2b, c). 

Fruit form phenotyping. Oil palm trees were grown to maturity in open planta- 
tions, making accurate phenotyping of fruit form for some samples difficult owing 
to variation in fertility, yield and environment. Several fruits were harvested from 
each palm, and shell thickness and fruit form was determined using established 
criteria*. To assess the accuracy of the fruit form phenotypic data used in the study, 
we reviewed 460 phenotype calls that were made between 2003 and 2012. In this 
period, up to three independent attempts were made to visually determine the fruit 
form phenotypes of 340 palms, 241 of which were from the T128 selfed population 
used to map SHELL, and 99 from a different population for which re-phenotyping 
data were available. In the data, ambiguous calls were made 26 times (or 5.7% of 
total phenotype determinations) where breeders were unsure of the fruit form 
phenotype. 

Homozygosity mapping. A total of 43 individual pisifera palms from the AVROS 
pedigree (originating from Congo) were sourced from the MPOB, Sime Darby and 
Kulim plantations. Fourteen of the AVROS pisifera palms were independently 
sequenced, whereas the DNA from the remaining 29 palms was pooled for sequen- 
cing (pool 1). Whole-genome shotgun sequence data was generated on the HiSeq 
2000 (Illumina). Individual trees and pools of trees were sequenced to 20X 
and 40X raw sequence coverage, respectively. Individual reads were mapped to 
sequence scaffolds from the pisifera reference genome assembly, and highest 
probability SNPs were located on scaffold 43 of the pisifera build 3 (p3-sc00043). 
In each of the 14 genome sequences, SNPs were summed over 10-kb windows along 
the scaffold and plotted against map location. This scaffold was computationally 
annotated for genes by comparing to public databases for the A. thaliana and rice 
genomes using the NCBI BLAST similarity searching tool. The genes with the 
highest homozygosity within the predicted interval from the genetic map were 
screened for putative function and for nucleic and amino acid changes between 
the pisifera and dura lines, as well as for amino acid differences in other species. 
Validation and genotyping. DNA sequencing of the eight exons of the SHELL 
gene was carried out for all palms of the mapping family as well as the AVROS 
pisifera palms used in homozygosity mapping. One of the 14 sequenced pisifera 
palms (TP10) proved to be heterozygous for the sh“Y®°S and shM?°? haplotypes, 
due to contamination in a breeding trial, and was not used for the homozygosity 
analysis. PCR primers were designed based on the reference pisifera genome 
sequence to amplify the entirety of SHELL exon 1. SHELL-specific primer 
sequences were 5'-TTGCTTTTAATTTTGCTTGAATACC-3’ (forward primer 
upstream of exon 1) and 5’-TTTGGATCAGGGATAAAAGGGAAGC-3’ (reverse 
primer downstream of exon 1). Primer sequences were confirmed to be unique in 
the reference pisifera genome, and to avoid any identified polymorphic nucleotides. 
A 5’ M13 forward sequence tag (5'-GITTTCCCAGTCACGACGTTGTA-3’) 
was added to the exon 1 forward PCR primer. A 5’ M13 reverse sequence tag 
(5'-AGGAAACAGCTATGACCAT-3’) was added to the exon 1 reverse primer. 
SHELL exon 1 was amplified from genomic DNA and PCR amplification was 
performed using 20 ng of purified genomic DNA under standard PCR amplifica- 
tion conditions. Amplicons were treated with exonuclease I and shrimp alkaline 
phosphatase to remove unincorporated primers and deoxynucleotides. An aliquot 
of each amplicon was sequenced using a M13 forward primer on an ABI3730 
instrument under standard conditions. Each amplicon was sequenced twice in 
the forward direction. An aliquot of each amplicon was additionally sequenced 
using the M13 reverse primer. Each amplicon was sequenced twice in the reverse 
direction. All sequencing data were aligned to the reference pisifera genome 
sequence. Data were analysed to determine the genotype at each of the two SNP 
positions identified to be associated with the pisifera fruit form. 

We sequenced SHELL exon 1 from 336 individual palms from the T128 mapping 
population, the samples used to construct homozygosity maps, and a collection of 
palms in crosses with advanced lines (100 pisifera, 148 tenera, 86 dura and 2 with 
ambiguous phenotype). A total of 323 (96.7%) had SHELL genotypes concordant 
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with their phenotype, and 11 (3.3%) had discordant phenotypes, reflecting the 
accuracy of phenotyping in the plantation (see above). SHELL exon 1 was also 
sequenced from all four pisifera palms derived from T X T crosses between shM?O® 
and sh“V®°S (FELDA AA and MPOB PK575), and proved to be heteroallelic as 
predicted. A further three pisifera palms were also found to be heteroallelic, includ- 
ing TP10 which was sequenced completely and proved to be a contaminant in the 
AVROS pedigree (see above). The other two heteroallelic palms were probably 
similarly contaminated. 

A second attempt was made to phenotype and resequence the 11 apparently 
discordant trees, enabling the re-evaluation of nine trees for fruit form. The second 
phenotype call of one palm was concordant with the genotype prediction, whereas 
the second phenotype calls of seven palms were ambiguous, and one palm retained 
the original phenotype. This palm was re-genotyped and proved to have a con- 
sistent genotype. It is plausible that the nine palms remaining (or 2.7% of the 
genotyped population) had been misphenotyped originally, given that fruit form 
phenotyping error is believed to be in excess of 5% (see above), highlighting the 
need for a molecular assay that more accurately predicts fruit form. This assump- 
tion was confirmed in six of the nine palms for which haplotypes were available, as 
haplotypes were consistent with genotype not phenotype, ruling out recombina- 
tion as an explanation for discrepancy. The map expansion immediately around 
the SHELL gene is similarly explained by misphenotyped palms. That is, four SNP 
markers flanking the SHELL gene, have 9 recombinants (of the 238 palms), which 
include 6 or the 9 misphenotyped palms from sequencing of the SHELL exon 1. 
In situ hybridization. Tissue sections (5 um) were mounted on Superfrost slides, 
dried overnight, and baked at 60°C. Sections were deparaffinized in xylene, 
immersed in 100% ethanol and dried. The sections received light treatment with 
protease. Two LNA probes to the target messenger RNA were designed and 
supplied by Exiqon (5'-DIG-ATTAACAAGCAGCGACATACTT-3’ and 5’- 
DIG-TTGATGGTGTGAATAGTGTTGT-3’). A Scramble-miR negative control 
LNA probe was also provided by Exiqon (5'-DIG-GTGTAACACGTCTATACGC 
CCA-3’). Optimized probe cocktail solution in Exiqon hybridization buffer was 
placed on the tissue sections. The sections were covered with polypropylene cover- 
slips and heated to 60 °C for 5 min followed by hybridization at 37 °C overnight. 
Sections were washed in high stringency solution (0.2 X SSC with 2% BSA) at 60 °C 
for 10 min. The LNA probes were detected using alkaline phosphatase conjugate 
(NBT/BCIP, blue precipitate). Sections were counterstained with Nuclear Fast 


Red. Sections were rinsed and mounted with coverslips. Aperio scans of the slides 
were made and images were extracted with ImageScope software. 
Yeast two-hybrid assays. The coding sequences for oil palm Sh?!" 
sh*V®OS and rice OsMADS24 were synthesized as two approximately 300-bp 
gBlocks each that overlapped by 30 bp (Integrated DNA Technologies). Gibson 
assembly of the two fragments was performed using kit manufacturer’s protocols 
(NEB). EcoRI and BamHI sites were added to the gBlock sequences for simple 
ligation into MatchMaker Gold Yeast Two-Hybrid vectors. Each sequence was 
cloned into both the binding domain vector, pGBKT7, and the activation domain 
vector, pGADT7. SHELL sequences encoded amino acids 2-175, including the 
entire MADS-box, I and K domains. The C domain was excluded from yeast two- 
hybrid constructs to avoid auto-activation of selection genes in the yeast two- 
hybrid system. The Sh?*!P""* peptide sequence encoded by the vectors was: 
GRGKIEIKRIENTTSRQVTFCKRRNGLLKKAYELSVLCDAEVALIVFSSRGRL 
YEYANNSIRSTIDRYKKACANSSNSGA TIEINSQQY YQQESAKLRHQIQILQN 
ANRHLMGEALSTLTVKELKQLENRLERGITRIRSKKHELLFAEIEY MQKREV 
ELQNDNMYLRAKIAEN. The sh™?°? peptide sequence encoded by the vectors 
was identical to the above sequence, with the exception that the underlined Leu 
residue (L) was converted to Pro (P). The sh*Y®°S peptide sequence encoded by 
the vectors was identical to the above sequence, with the exception that the under- 
lined Lys residue (K) was converted to Asn (N). OsMADS24 sequences encoded 
amino acids 2-177, including the entire MADS-box, I and K domains, but 
excluding the C domain. The OsMADS24 sequence encoded by the vectors was: 
GRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGK 
LYEFCSGQSMTRTLERYQKFSYGGPDTAIQNKENELVQSSRNEYLKLKARV 
ENLQRTQRNLLGEDLGTLGIKELEQLEKQLDSSLRHIRSTRTQHMLDQLTD 
LQRREQMLCEANKCLRRELEES. Auto-activation control tests were performed 
by transforming each fusion vector into yeast alone, and each vector showed 
no auto-activation of selection reporter genes. Co-transformations were per- 
formed for all 16 pairwise combinations of BD and AD vectors and scored for 
growth on SD-Leu-Trp, SD-Leu-Trp-His, SD-Leu-Trp-His-Ade and X-gal media 
plates. Positive interactions were scored as blue co-tranformants (on X-gal plate) 
that were able to grow on SD-Leu-Trp-His-Ade selection plates (Supplementary 
Fig. 5). 
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Maternal imprinting at the H19-Igf2 locus maintains 
adult haematopoietic stem cell quiescence 


Aparna Venkatraman’, Xi C. He', Joanne L. Thorvaldsen®, Ryohichi Sugimura’, John M. Perry’, Fang Tao', Meng Zhao!, 
Matthew K. Christenson', Rebeca Sanchez!*, J aclyn Y. Yul, Lai Peng”, Jeffrey S. Haug", Ariel Paulson', Hua Li', Xiao-bo Zhong, 


Thomas L. Clemens®, Marisa S. Bartolomei? & Linheng Li>” 


The epigenetic regulation of imprinted genes by monoallelic DNA 
methylation of either maternal or paternal alleles is critical for embry- 
onic growth and development’. Imprinted genes were recently shown 
to be expressed in mammalian adult stem cells to support self-renewal 
of neural and lung stem cells**; however, a role for imprinting per se 
in adult stem cells remains elusive. Here we show upregulation of 
growth-restricting imprinted genes, including in the H19-Igf2 
locus’, in long-term haematopoietic stem cells and their downregu- 
lation upon haematopoietic stem cell activation and proliferation. 
A differentially methylated region upstream of H19 (H19-DMR), 
serving as the imprinting control region, determines the reciprocal 
expression of H19 from the maternal allele and Igf2 from the pater- 
nal allele’. In addition, H19 serves as a source of miR-675, which 
restricts Igflr expression’. We demonstrate that conditional dele- 
tion of the maternal but not the paternal H19-DMR reduces adult 
haematopoietic stem cell quiescence, a state required for long-term 
maintenance of haematopoietic stem cells, and compromises hae- 
matopoietic stem cell function. Maternal-specific H19-DMR dele- 
tion results in activation of the Igf2-Igfrl pathway, as shown by the 
translocation of phosphorylated FoxO3 (an inactive form) from 
nucleus to cytoplasm and the release of FoxO3-mediated cell cycle 
arrest, thus leading to increased activation, proliferation and even- 
tual exhaustion of haematopoietic stem cells. Mechanistically, 
maternal-specific H19-DMR deletion leads to Igf2 upregulation 
and increased translation of Igflr, which is normally suppressed 
by H19-derived miR-675. Similarly, genetic inactivation of IgfIr 
partly rescues the H19-DMR deletion phenotype. Our work estab- 
lishes a new role for this unique form of epigenetic control at the 
H19-Igf2 locus in maintaining adult stem cells. 

Our earlier studies showed that imprinted genes, including those within 
the H19-Igf2 locus (Fig. 1a), are differentially expressed in haematopoietic 
stem and progenitor cells (HSPCs)’. To explore this further, we system- 
atically analysed imprinted gene expression in quiescent-enriched long- 
term haematopoietic stem cells (LT-HSCs), more active short-term 
(ST)-HSCs and multipotent progenitor (MPP) populations (Fig. 1b)*. 
Out of 88 imprinted genes, 23 were differentially expressed in these 
populations. Of these 23, 15 were preferentially expressed in LT-HSCs, 
whereas the others were predominantly expressed in ST-HSCs and 
MPPs (Fig. 1c). Intriguingly, 80% of the imprinted genes with predom- 
inant expression in LT-HSCs were associated with growth restriction, 
including H19, Cdkn1c/p57, Ndn, Rb, Gtl2 and Grb10 (ref. 9). In con- 
trast, imprinted genes expressed preferentially in ST-HSCs and MPPs, 
including Ascl2, Peg12, Sfmbt2, Pon3, Atp10a and Osbpl5, were assoc- 
iated with growth promotion and increased metabolism” (see quant- 
itative real-time PCR assay in Supplementary Fig. 1a). 

Given the critical role of H19 during embryonic development and its 
preferential expression in LT-HSCs, we considered that it plays a role 


in restricting LT-HSC activation. To test this idea, we conditionally 
deleted H19-DMR (an epigenetic regulator that controls expression of 
H19) by breeding H19/P™®SM® mice with Mx1-Cre mice to generate 
maternal (mH19*?“*/*) and paternal (pH1 GAPMR/+) allele-specific 
mutants (Supplementary Fig. 1b). The DMR region was deleted with 
100% efficiency in LT-HSC (Supplementary Fig. 1c, e-g)'". As early as 
6 weeks, flow cytometric analysis showed a substantial decrease in 
frequency and absolute number of LT-HSCs in mH1942“®'* mice 
(Fig. 1 d-f and Supplementary Fig. 1d), but not in pH19°?*/* mice 
(Fig. le and Supplementary Fig. 2a, b). Concurrently, we observed a 
significant increase in frequency and absolute number of ST-HSCs 
(Fig. 1d-f); however, the total number of bone marrow cells remained 
unchanged (Fig. 1g). By 6 months, both LT- and ST-HSCs were sig- 
nificantly decreased in frequency and absolute number, whereas bone 
marrow cellularity increased only in mH19*?™** mice (Fig. 1d-g and 
Supplementary Fig. 2c, d). 

Cell cycle analysis of LT-HSCs at 6 weeks after pIpC induction 
showed a decrease in the GO phase fraction and a concomitant increase 
in the G1 phase fraction in mH19*?™*’* relative to control (Fig. 1h-j). 
We then tested the response of mH19°?™“*/* mice to bone marrow 
damage by administering 5-fluorouracil (SFU), which eliminates 
active HSPCs while sparing quiescent HSCs. Surviving quiescent 
HSCs later replenish lost HSPCs’* (Fig. 1k). In this context, a signifi- 
cant reduction in quiescent HSCs after three cycles of 5FU treatment 
led to deficient bone marrow recovery in the mH197?™*”* mutant com- 
pared with control (Fig. 11, m and Supplementary Fig. 2f). Altogether, 
maternal but not paternal deletion of H19-DMR resulted in loss 
of HSC quiescence, leading to progressive loss of LT-HSCs and then 
ST-HSCs, accompanied by increasing progenitor cell proliferation and 
differentiation, thus ultimately increasing total bone marrow cellula- 
rity (Fig. 1g and Supplementary Figs 2e and 3a-d). 

To characterize the phenotype functionally, we transplanted equal 
numbers of sorted LT-HSCs from mutants and their control litter- 
mates. We observed a significant reduction in reconstitution ability for 
LT-HSCs derived from mH194?™** but not pH194?’*’* mutants 
compared with controls. Although overall engraftment was reduced in 
primary and secondary recipients, no mature lineage bias was appar- 
ent (Fig. 2a-f). Limiting dilution analysis to quantify functional HSCs 
showed a 2.5-fold decrease in mH197?™””* mutant HSCs relative to 
control (Fig. 2d). Reciprocal transplantation of wild-type donor cells into 
either mH19*?™** or control recipients did not result in alterations in 
haematopoiesis (Fig. 2g, h), indicating that an intrinsic change in the 
mH194?™’* mutant HSCs was the primary cause of the phenotype. 

Next, we investigated whether H19-DMR controls the imprinted 
expression of H19 and Igf2 from the maternal and paternal alleles, 
respectively, in adult HSCs, as is observed in embryos''. Our RNA- 
seq analysis showed differential expression of H19 as well as Igf2 in 
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Figure 1 | Defective LT-HSCs in mH197?"”* mice. a, H19-Igf2 cluster. 
Top: red box, maternally expressed; blue box, paternally expressed; grey boxes, 
genes in the cluster. Bottom: unmethylated (open circles) and methylated (filled 
circles) CpG dinucleotides; red cross, no expression; arrow, active transcription. 
IR, imprinting region. b, Upper cartoon, hierarchical organization. Curved 
arrow, self-renewal. Lower image: fluorescence-activated cell sorting (FACS) 
plot. Left panel: gated on LSK; right panel: on CD34 and FLK2. c, Heat map of 


HSCs (Fig. 3a, b). By crossing H19"?M®’* females with Mus castaneus 
(Cast) males, which enables parental allele-discrimination by single nuc- 
leotide polymorphism analysis, we further detected exclusive expression 
of Igf2 from the paternal allele in HSCs (Fig. 3c). However, after dele- 
tion of the maternal H19-DMR, we detected H19 downregulation 
and Igf2 upregulation, which resulted from biallelic Igf2 expression 
in HSCs (Fig. 3d-f). Igf2 was similarly upregulated in bone marrow, 
including surrounding stromal cells, after maternal deletion of H19- 
DMR (Fig. 3g, h). However, as shown by reciprocal transplantation, an 
extrinsic increase of Igf2 expression alone is not sufficient to cause the 
mH194?M®'* HSC phenotype. We next investigated whether the Igf2 
signalling”? pathway is activated in mH19°?™** LT-HSCs. Binding of 
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Igf2 to Igflr activates signalling, whereas Igf2 binding to Igf2r attenu- 
ates signalling’. Igf2 and Igflr amounts were gradually increased from 
LT-HSCs to MPPs (Fig. 3i, j). However, messenger RNA (mRNA), 
protein concentrations, and the number of Igfir™ cells were signifi- 
cantly increased in mH19°?”*/* LT-HSCs compared with controls 
(Fig. 3k-m), with no change in Igf2r expression (Fig. 3k). Igf2-Igflr 
signalling is known to activate PI3K-Akt, which phosphorylates 
and inactivates FoxO3, a transcription factor that arrests the cell 
cycle’*"®. Inactive pFoxO3 was detected in only 15% of normal 
LT-HSCs but was substantially increased in ST-HSCs and MPPs 
(Fig. 3n, 0); however, in mH J 9APMRI+ pFoxO3 was detected in 75% of 
LT-HSCs. Our data indicate that H19-DMR deletion increased Igf2 
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re-population assay 16 weeks after transplantation in (a) sorted 100 LT-HSCs 
(n = 10), (c) primary transplant (test dosage of 2 10° bone marrow cells) 
(n = 10), (e) secondary transplant (m = 10), (g) reciprocal transplantation 

(n = 4). d, Competitive re-population unit (CRU) content in each group of 
mice transplanted at each dose (n = 60 mice in total). Horizontal dotted line, 
37% of recipient mice failed to engraft; vertical dotted lines, various CRU 
frequencies for each condition. Donor-derived lineage analysis after 

(b) primary transplantation, (f) secondary transplantation, (h) reciprocal 
transplantation. Error bars, s.e.m. 


signalling, which released FoxO3-mediated suppression of HSC acti- 
vation and proliferation. 

In the placenta, H19 functions as a precursor of miR-675, which in 
turn suppresses Igflr°. We next investigated whether this regulation 
exists in adult HSCs. Expression of miR-675 was highest in LT-HSCs 
in the control mice but was substantially reduced in mH198?M*’* 
LT-HSCs (Fig. 4a). To explore the potential role of miR-675 in Igflr 
regulation, we transplanted bone marrow cells overexpressing miR- 
675 into wild-type mice. Overexpression of miR-675 increased the 
percentage of quiescent CD34” lineage Sca-1* cKit* (LSK) cells but 
did not significantly affect active CD34* LSK cells (Fig. 4b). Western 
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Figure 3 | Activation of Igf2-FoxO3 signalling in mH. 1QAPMEI+ nice. 


a, b, RNA-seq analysis of Igf2 and H19 transcripts. FPKM, fragments per kilobase 
of exon per million fragments mapped. c, d, Allele-specific expression of the 
Igf2 transcript. Quantitative (q)RT-PCR 6 weeks after pIpC induction (n = 3) 
for (e) H19, (f) Igf2, (g) stromal cells and (h) total bone marrow (n = 5). 

i, j, Single-cell Igflr and Igf2 staining from wild-type bone marrow cells and its 
quantification (n = 4). k, Igflr and Igf2r expression in sorted LT-HSCs. |, m, Igflr 
immunostaining and its quantification (n = 4). n, 0, Single-cell phospho-FoxO1/ 
3/4 staining and its quantification (n = 4). Scale bar, 10 tm. Error bars, s.e.m. 


blot analysis showed a significant reduction of Igflr by miR-675 com- 
pared with control (Fig. 4c). Furthermore, Igflr amounts were lower in 
CD34 LSK compared with CD34* LSK cells in the control. However, 
miR-675 overexpression significantly decreased Igflr (Fig. 4c-e). 
These data demonstrate that H19-derived miR-675 regulates Igflr and 
the corresponding quiescent state in HSCs. 

To confirm further that H19-DMR controls Igf2-Igfir signalling, we 
crossed female H19'?®* mice with male Mx1-Cre:Igfir*" mice 
(Supplementary Fig. 4a)”. Although mH194?™*’* mutants (Fig. 3f, g) 
showed a decrease in LT-HSCs and an increase in ST-HSCs and MPPs, 
Igflr-’~ mutants showed an increase in LT-HSCs and a decrease in 
MPPs. This indicates that Igflr regulates the transitions from LT-HSCs 
to ST-HSCs and further to MPPs. Interestingly, mH19¢?*’* Iefr-/— 
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Figure 4 | Igflr regulation and rescue by genetic blockage of Igf2-Igfir after lentiviral infection and its quantification (n = 4). f, Representative FACS 
signalling. a, miR-675 analysis by qRT-PCR (n = 3). b, Frequency of CD34 _ plot with frequency ( = 4), (g) absolute number and (h) cell cycle analysis. i, Pan- 
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analysis of Igflr in sorted bone marrow cells (positive and negative for green | RNA sequencing analysis in sorted stem cells: k, imprinted genes; 1, m, Wnt and 
fluorescent protein (GFP)). d, e, Single-cell immunostaining of Igflr 8 weeks BMP signalling; n, heat map of miRNA expression. Error bars, s.e.m. 


double mutants showed a partial restoration of LT-HSC frequency and pFoxO3 staining in double mutants showed partial rescue of the loss 
(Fig. 4f), whereas the transition from ST-HSCs to MPPs was still blocked — of quiescence phenotype (Fig. 4h, i). 

(Fig. 4f, g). This indicates that Igf2-Igflr signalling is partly responsible To demonstrate phenotypic rescue functionally in mH194?™"’* 
for the mH194?™*”* phenotype. An increase in ST-HSC frequency in Igflr ‘~ mice, we performed bone marrow transplantation assays. 
double-mutant mice was probably due to blocked transition from ST- Although mH194?*’* mutants had significantly reduced engraft- 
HSC to MPP by downregulation of Igflr. Furthermore, cell cycle analysis ment due to LT-HSC loss, engraftment of mH194°™”* Ieflr-’~ bone 


348 | NATURE | VOL 500 | 15 AUGUST 2013 
©2013 Macmillan Publishers Limited. All rights reserved 


marrow cells increased to an amount between that of the mH194?™*’* 


and the Igfir_‘~ single mutants, indicating a partial functional rescue 
(Fig. 4j and Supplementary Fig. 4b). These results indicate that the 
maternal H19-DMR controls Igf2-Igflr signalling which regulates 
HSC state; however, the partial rescue indicates that deletion of H19- 
DMR also affects other pathways required for LT-HSC maintenance. 
To investigate this possibility, we performed RNA-seq analysis of 
HSCs isolated from control, mH19°?™* single, Igflr ‘~ single and 
mH191P™** Tefir/~ double mutants. mH19*?“*/* HSPCs showed 
widespread alterations in expression of imprinted genes in all three 
populations (Supplementary Figs 4c and 5). Genes involved in cell cycle 
arrest (CdknIc)'*"’, tumour suppression and stem cell maintenance 
(Ndn, Gtl2)”°*! were downregulated in LT-HSCs. However, Igfir / ae 
LT-HSCs largely maintained expression patterns similar to control, 
with high expression amounts of Cdkn1c, Ndn, Gtl2 and Dik1. The 
double mutants generally showed partial rescue of the alterations 
observed in single mutants, indicating either compensation in gene 
expression and/or existence of a proposed imprinted gene network” 
(Fig. 4k and Supplementary Fig. 4c). Gene expression profiling of non- 
imprinted genes showed many overlapping downstream genes and 
microRNA (miRNAs) that were abnormally expressed in single mutants 
but partly rescued in double mutants (Fig. 4l-n and Supplementary Figs 
6a-c and 7e). These included components of the Wnt and Tgf-B/BMP 
pathways such as Smad4, Id2 and Fzd8 (refs 24-27) as well as Let-7, which 
is known to repress cell proliferation** and Igf signalling” (Fig. 4l-n). 
Interestingly, H19-DMR potentially controls other miRNAs, small nuc- 
leolar RNAs (SnoRNAs) and genes (Dusp26, p2rx2 and Gpr63) inde- 
pendent of the Igf2-Igflr signalling (Supplementary Fig. 7a-f), with a 
known function for Dusp26 in inhibiting cell proliferation”. Taken 
together, our data show that maternal H19-DMR primarily restricts 
Igf2-Igflr signalling, but also influences other genes and miRNAs 
involved in maintaining HSC quiescence. 

By studying the H19-DMR locus in an allele-specific manner, we 
demonstrate that a specialized form of epigenetic control—genomic 
imprinting—is critical to the maintenance of adult stem cells. This is 
accomplished by maintaining LT-HSC quiescence, which can be attri- 
buted largely to Igf2-Igflr-dependent signalling, but also to extra Igf2- 
Igflr-independent effects on the regulation of cell cycle, proliferation 
and growth. 


METHODS SUMMARY 


Genotyping primers, flow assays and immunostaining were reported previously’ 
Lentivirus-mediated transfection used Magnetofection™ and ViroMag RIL particles 
(OZ Biosciences). Microarray and RNA-seq analyses used Affymetrix MouseGenome430_ 
2 arrays and Illumina TruSeq RNA Sample Prep Kit (catalogue number FC-122- 
1001) respectively, followed by sequencing on an Illumina HiSeq 2000. Statistical 
analyses were performed (pairwise comparisons were performed using a Student’s 
t-test) in Prism 5 GraphPad. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Animals. All mice used in this study were housed in the animal facility at the 
Stowers Institute for Medical Research and handled according to the guidelines 
of the Institute and the National Institutes of Health. All procedures were approved 
by the Institutional Animal Care and Use Committee of the Stowers Institute. H19- 
DMRIPMRAPMR mice on B6 background were provided by M. S. Bartolomei*’. 
Conditional mutant Igfr”* was provided by T. L. Clemens. Interferon-inducible 
Mx1-Cre or tamoxifen-inducible Scl-Cre mouse strains were used to delete the 
floxed H19-DMR and Igfir. For Mx1-Cre activation, 250 jig of pIpC was injected 
intraperitoneally every other day for 14 days at 5 weeks of age. For Scl-CreER activa- 
tion, 2 mg of tamoxifen dissolved in 0.1 ml of corn oil was injected intraperitoneally 
every day for 5 days. 

Single-cell HSC genotyping. Single CD34 FIk2” LSK cells were sorted into 96- 
well plates (one cell per well) containing 50 jl MethoCult complete media (M3434; 
Stem Cell Technologies) and incubated (37 °C, 5% COz) for 12 days. Individual 
colonies were collected separately, and DNA was purified using a QIAGENamp 
DNA Blood Kit (Qiagen). PCR genotyping of H19 locus used G1, G5 and G7 
primers described elsewhere*’. 

Flow cytometry. Phenotypic analyses of haematopoietic cells collected from bone 
marrow (femur and tibia) and peripheral blood were performed as described 
previously****. Cell sorting and analysis were performed with a MoFlo (Dako) 
and/or CyAn ADP (Dako). We analysed data with FlowJo software (Ashland). 
5FU treatment. Cohorts of mH19*?“*/* and mH19"0™*’* were injected with 
5FU (Sigma-Aldrich) at 150 pg per g (body weight)** 4 weeks after pIpC induction. 
For one cycle, 5FU was injected once intravenously; for three cycles, 5FU was 
injected three times at 4-week intervals. Bone marrow cells were analysed 10 days 
after 5FU injections. 

Cell cycle analysis. Cell cycle analysis of bone marrow LSKs was performed. Bone 
marrow cells (5 X 10°) were stained for LSKs, fixed overnight at 4°C in 4% 
paraformaldehyde and permeabilized with 0.2% Triton X-100. Cells were further 
stained with a BD Pharmingen™ FITC-conjugated-Mouse Anti-Human Ki67 Set 
(BD Pharmingen) according to manufacturer’s instruction and 0.1 pg pl’ DAPI. 
This was followed by flow cytometric analysis with an InFlux Cell Sorter (BD 
Biosciences). 

Transplantation studies. For competitive re-population assays, 2X 10° bone 
marrow cells congenic with the host (CD45.1*) were included per mouse. One 
hundred sorted LT-HSCs from mH19¢?™”"* or mH19"?™** were transplanted 
intravenously into lethally irradiated (10 Gy) Ptprc (CD45.1) recipient mice. Mice 
were placed on Baytril water 3 days before irradiation, which continued for 2 weeks 
after irradiation. Each transplanted group consisted of eight to ten recipients. 
Donor-derived engraftment was assayed every 4 weeks after transplantation by 
collection of peripheral blood, red blood cell lysis and staining of CD45.1 (recipi- 
ent) versus CD45.2 (donor). Multi-lineage reconstitution was determined by CD3, 
B220 (for T and B lymphoid, respectively) and Gr1, Mac-1 (for myeloid) gating on 
donor (CD45.2°) cells. Limiting dilution experiments were performed by trans- 
planting three doses (200,000, 100,000 and 25,000) of test samples (n = 2) from 
mH1942M8* or mH19"M¥/* (Cp 45.2) with a fixed number of 2 X 10° rescue 
cells (CD45.2) into groups of ten lethally irradiated (10Gy) recipient mice 
(CD45.1). CRU frequency was determined with L-Calc software (Stem Cell 
Technologies) on the basis of Poisson statistics’. The plot was made on the basis 
of the percentage of recipient mice containing 1% CD45.2 cells in the peripheral 
blood at 16 weeks after transplantation versus the number of cells injected per 
mouse. For secondary transplantation, the original, primary transplant recipients 
were euthanized; bone marrow was collected from the femur and then trans- 
planted mouse-to-mouse at a dosage of 1 X 10° cells per mouse. For reciprocal 
transplantation, wild-type congenic B6.SJL (CD45.1*) bone marrow cells (1 X 10° 
cells per recipient) were transplanted into lethally irradiated Mx-1 Cre induced 
mH19™?* and mH194?2* (CD45.2*) recipients. Complete donor cell 
engraftment by wild-type CD45.1~ cells was confirmed by flow cytometric ana- 
lysis. For rescue transplants, 2X 10° (CD45.2) bone marrow cells from the 
mH19"PM8* Teftr-/~ and mH1910®* Ieftr-/~ mutants and the controls along 
with 2 X 10° (CD45.1*) Ptpre bone marrow cells were transplanted into CD45.1 
recipient mice. Complete donor cell engraftment by wild-type CD45.1~ cells was 
confirmed by flow cytometric analysis. 

Lentivirus infection. Mice were treated with 150 1g per g (body weight) of 5FU to 
activate and enrich for HSPCs*. Four days later, bone marrow was collected and 
cultured overnight in ST media and transduced by Magnetofection™ using 
ViroMag R/L particles according to the manufacturer's protocol (OZ Bio- 
sciences). Transplantation experiments conducted in the knockdown model were 
done with 300,000 unsorted infected bone marrow cells (CD45.2). The cells were 
transplanted into each lethally irradiated (10 Gy) Ptprc (CD45.1). Eight weeks 
after engraftment, bone marrow cells that were double positive for GFP and 
CD45.2 were sorted for CD34 LSK and CD34" LSK. 


Lentivirus construction. The pSicoR-EFla promoter-IRES-EGFP lentiviral con- 
struct was provided by T. Xie. 

mir-675: forward, AGCGTGCGGCCCAGGGACTGGTGCGGAAAGGGCCCA 
CAGTGGACTTGGTACACTGTATGCCCTAACCGCTCAGTCCCTGGGTC 
TGGCA,; reverse, GGCATGCCAGACCCAGGGACTGAGCGGTTAGGGCAT 
ACAGTGTACCAAGTCCACTGTGGGCCCTTTCCGCACCAGTCCCTGGG 
CCGCA. 

IGF2 shRNA: forward, AGCGCGCCCAAATTTGATTGGCTCTAAATAGTGA 
AGCCACAGATGTATTTAGAGCCAATCAAATTTGGTCA; reverse, GGCAT 
GACCAAATTTGATTGGCTCTAAATACATCTGTGGCTTCACTATTTAGA 
GCCAATCAAATTTGGGCGG. 

Allele-specific Igf2 expression. Male Cast mice were crossed with female flDMR 
or ADMR. Heterozygous progeny at single nucleotide polymorphisms differ 
between the two strains. In mice that inherited the Cast allele paternally and 
f1[DMR maternally, LT-HSCs, ST-HSCs and MPPs were sorted from total bone 
marrow cells. RNA was extracted and DNase treated using RQ] RNase-Free 
DNase according to the manufacturer’s instructions (Promega). This RNA was 
reverse transcribed in the presence of SuperScriptIII Reverse Transcriptase (SSIII) 
using Igf2-specific primer Igf2-20r (5'-gggttgtttagagccaatcaa-3') according to the 
manufacturer’s instructions (Invitrogen); simultaneously, equal concentrations of 
RNA were identically treated in the absence of SSIII (for minus (—) RT). Equal 
volumes of RT and —RT were amplified using 0.5 1M of primers Igf2-18f (5'- 
atctgtgacctcctcttgagcagg-3’) and Igf2-20r and Go-Taq Green Master Mix 
(Promega) using the following PCR conditions: 94°C for 2 min for one cycle; 
94°C for 15s, 58°C for 15s, and 72°C for 20s for 43 cycles. No product was 
detected in —RT samples. Amplified Igf2 was digested with MluCI (NEB) and the 
paternal Cast product (165 base pairs (bp)) and the maternal B6 product (180 bp) 
were resolved on a 15% polyacrylamide gel similar to methods described earlier’! 
Microarray. RNA was extracted by conventional TRIzol® method from sorted 
LT-HSCs, ST-HSCs and MPPs (10,000 cells each)**. Samples were analysed with 
Affymetrix MouseGenome430_2 arrays and scanned with a GeneChip Scanner 
3000 7G using GeneChip Fluidics Station 450 and GeneChip Operating Software 
(GCOS 1.4). Heat map data represent the fold change between two populations 
from at least three independent biological samples. Three samples were run on 
Affymetrix Mouse 430.2 arrays in triplicate, using the standard Affymetrix pro- 
tocols. CEL files were read into the R software environment (http://www.cran. 
r-project.org/) and normalized with RMA*’~’. Normalized data were fitted with a 
linear model using the limma package*® and three contrasts were fitted: CD34p/ 
CD34n, FLK2p/CD34p, FLK2p/CD34n. Probes that were significant for at least 
one contrast (BH adj P< 0.05) went to further analysis. 

A list of imprinted genes was taken from the catalogue of imprinting genes at 
http://igc.otago.ac.nz/1101Summary-table.pdf. Names were matched to MGI and 
Ensembl 63 genes and then converted to probeset identifications. Of 125 input 
genes, 86 could be mapped to probesets, and, of these, 23 were significant. Sample 
expression coefficients per probeset were averaged together by gene. Expression 
amounts varied widely, from 4 to 14 in log, scale, which obscured the regulatory 
trend across samples during clustering. We constructed a heat map to show only 
the trending of expression, not the magnitude, by subtracting the mean from each 
row and dividing by the standard deviation. Thus, the heat map scale shows 
expression z-scores. Row ordering reflects hierarchical clustering, average linkage, 
using Pearson dissimilarity for distance. Microarray data have been deposited in 
ArrayExpress under accession number E-MTAB- 1644. 
qRT-PCR. Total RNA (2-50 ng) was extracted from sorted LT-HSCs, ST-HSCs and 
MPPs directly into TRIzol®. This was followed by DNase I treatment (Ambion) and 
reverse transcription with a high-capacity complementary DNA (cDNA) reverse 
transcription kit (Applied Biosystems). cDNA was pre-amplified by TaqMan 
PreAmplification master mix (Applied Biosystems) according to the manufac- 
turer’s instructions. TaqMan gene expression assays (Applied Biosystems) were 
performed on triplicate samples with a 7500 Real-Time cycler (Applied 
Biosystems). Data were normalized relative to Gapdh and Hprt1. For mi-R- 
675-3p assay, extracted RNA were reverse transcribed using TaqMan miRNA 
reverse transcription kit (Applied Biosystems). TaqMan pre-amplification and 
TaqMan gene expression assay were performed according to the manufacturer’s 
instructions. All qRT-PCR was performed using TaqMan probes. 
Immunostaining. Immunostaining was performed as described previously”. For 
immunostaining of sorted cells, cells were sorted onto lysine-coated slides, fixed 
with chilled methanol for 10 min, followed by blocking and staining with primary 
antibody. The following primary antibodies were used: chicken anti Igflr 
(Abcam, 1:100), rabbit anti Igf2 (Abcam, 1:100), rabbit anti FoxO3a (1:100), rabbit 
anti Foxol1/3/4-Pan and phosphor_Thr24/32 (Assay biotech, 1:50). 

RNA-seq. The RNA-sequencing library was prepared from approximately 200 ng 
of total RNA (mH1942™®* Tofir~/~, mH194P!®" Toft’ and mH19"O*'*) 
for each sample using an Illumina TruSeq RNA Sample Prep Kit (catalogue number 
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FC-122-1001). The fragment size in the generated library ranged from 220 to 500 bp 
with a pick at 280 bp. A total of 10 fmol library fragments were loaded to cBot to 
generate clusters, followed by sequencing on an Illumina HiSeq 2000 to produce 
10 million to 30 million paired-end 100 bp reads per sample. Reads were trimmed to 
70 bp owing to quality and aligned to mm9 with Tophat 1.3.1 (ref. 42)/Bowtie 0.12.7 
(ref. 43), using the Ensembl 63 GTF file for gene models. Parameters were —g 1, 
mate-inner-dist 200, mate-std-dev 70, segment-length 35, segment-mismatches 2; 
this allowed for four mismatches per read (two per read half) and unique alignments 
only. 

Gene expression was quantified using Cufflinks 1.0.3 (ref. 44). We chose any 
imprinted genes with an absolute log-fold change of 1.3. A total of 38 imprinted 
genes were selected this way, 32 having measurements on both samples and six 
having measurements in only one sample. We heat mapped the genes found in 
both samples using FPKMs only. For the genes found in both, the range of 
expression was skewed enough to make visualization by heat map difficult, so 
we created a row-normalized heat map as with the microarray data. RNA-seq data 
have been deposited in ArrayExpress under accession number E-MTAB-1628. 
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PAAR-repeat proteins sharpen and diversify the type 


VI secretion system spike 


Mikhail M. Shneider7*, Sergey A. Buth", Brian T. Ho’, Marek Basler’, John J. Mekalanos® & Petr G. Leiman! 


The bacterial type VI secretion system (T6SS) is a large multicom- 
ponent, dynamic macromolecular machine that has an important 
role in the ecology of many Gram-negative bacteria. T6SS is respon- 
sible for translocation of a wide range of toxic effector molecules, 
allowing predatory cells to kill both prokaryotic as well as eukaryotic 
prey cells’°. The T6SS organelle is functionally analogous to con- 
tractile tails of bacteriophages and is thought to attack cells by ini- 
tially penetrating them with a trimeric protein complex called the 
VerG spike’. Neither the exact protein composition of the T6SS orga- 
nelle nor the mechanisms of effector selection and delivery are known. 
Here we report that proteins from the PAAR (proline-alanine-alanine- 
arginine) repeat superfamily form a sharp conical extension on the 
VerG spike, which is further involved in attaching effector domains 
to the spike. The crystal structures of two PAAR-repeat proteins 
bound to VgrG-like partners show that these proteins sharpen the 
tip of the T6SS spike complex. We demonstrate that PAAR proteins 
are essential for T6SS-mediated secretion and target cell killing by 
Vibrio cholerae and Acinetobacter baylyi. Our results indicate a new 
model of the T6SS organelle in which the VgrG-PAAR spike complex 
is decorated with multiple effectors that are delivered simultaneously 
into target cells in a single contraction-driven translocation event. 

The T6SS organelle is a 1,000-nm long tubular structure consisting 
of an inner tube made of multiple copies of the Hcp protein and an 
external contractile sheath composed of VipA and VipB proteins (also 
known as TssB/TssC)*. It has been visualized in two conformations 
(extended and contracted), which are both attached to the cell envelope 
by means of a baseplate complex’. Rapid contraction of the sheath 
results in translocation of the inner tube out of the predator cell and 
into the prey cell’. The trimeric VgrG spike protein is positioned at the 
end of the tube’. It is believed to penetrate the prey cell with its needle- 
shaped carboxy-terminal B-helical domain. VgrG proteins can contain 
additional C-terminal domains that act as effectors® and may also bind 
effectors through undefined mechanisms’. 

Because VerG proteins are orthologous to the central baseplate spikes 
of bacteriophages with contractile tails, we reasoned that additional struc- 
tural components present in certain phage spikes might have correspond- 
ing orthologous components in T6SS. The cryoEM reconstruction of 
phage T4 baseplate shows that an unknown protein with a molecular 
weight between 7 and 23 kilodaltons (kDa) binds to the tip of the B-helical 
domain of the central spike protein gp5 (ref. 9). We analysed all known 
genes encoding small proteins in phage genomes with gp5-like spikes and 
compared them to T6SS genes. Proteins containing the PAAR-repeat 
motif were strongly represented in this group with gp5.4 being the cor- 
responding protein of T4 phage. Furthermore, genes encoding proteins 
with PAAR motifs were frequently found immediately downstream from 
verG-like genes suggesting that the two are genetically linked’. Therefore, 
we devised a strategy to test the hypothesis that these PAAR proteins were 
binding to the tip of gp5 and VgrG proteins. 

Careful examination of VgrG sequences showed that a B-structural 


repeat, which is presumed to be responsible for B-helix formation®"®, 


either extends to the very C terminus of the protein or terminates with 
a glycine/serine-rich stretch (Extended Data Fig. 1). We surmised that 
the glycine/serine-rich stretch bends the polypeptide chain away from 
the B-helix without disturbing its tip, and that all VgrG B-helices have 
blunt ends resembling that of T4 gp5 B-helix. The X-ray crystal struc- 
ture of one full-length VgrG trimer supports this conclusion (Sycheva 
L. V., Shneider M. M., Basler M., Ho B., Mekalanos J. J. & Leiman P. G., 
manuscript in preparation). Thus, we further speculated that the bind- 
ing site for a PAAR protein was the blunt end of the B-helix and 
designed experiments to test this idea. 

Because of the solubility problems of most tested VgrG and PAAR 
proteins, we used a soluble fragment of T4 gp5 B-helix (residues 484- 
575), which is known to fold into a stable native-like trimeric structure’®, 
as a platform for creating B-helices mimicking the blunt end of various 
VerG spikes. We replaced the tip of gp5 B-helix (the last two B-strands) 
with an equivalent fragment from several putative B-helices of VgrGs 
from Escherichia coli, Vibrio cholerae and Pseudomonas aeruginosa 
(Extended Data Fig. 1). These gp5-VgrG chimaeras (hereafter referred 
to as gp5(VgrG)) were then co-expressed with various PAAR proteins, 
in cis and in trans, in different combinations (Extended Data Fig. 1). 
Surprisingly, the V. cholerae VCAO105 and E. coli c1882 PAAR pro- 
teins were found to bind not only to the ends of their endogenous VgrG 
proteins but also to the non-mutated wild-type end of the gp5 B-helix 
as well. 

The gp5(c1883)-cl1882 and gp5(VCA0018)-VCA0105 complexes 
were purified to homogeneity and crystallized (Extended Data Table 1). 
The structures were solved by molecular replacement" using the cor- 
responding gp5 {-helix fragment as a search model’®. In both com- 
plexes, a single chain of the PAAR protein folds into a symmetrical 
cone-shaped structure with a sharp tip and a triangular base fully 
occupying the blunt end of the B-helix (Fig. 1). The cone contains nine 
short B-strands, three of which create its base and participate in bind- 
ing to the VgrG B-helix and six others form three B-hairpins that point 
towards the vertex of the cone, but have different lengths. The PAAR 
proteins interact with the VgrG f-helix by means of a virtually 
flat hydrophobic patch and 14 or 16 hydrogen bonds for c1882 or 
VCAO0105, respectively (Extended Data Fig. 2). In both proteins, 12 
hydrogen bonds between the main chain atoms of the VgrG f-helix 
and those of the PAAR domain form a perfect triangle surrounding the 
central hydrophobic patch creating a unique binding platform (Extended 
Data Fig. 3). PISA software’ shows that ~16.5% of the PAAR protein 
surface is buried in this interface and the free energy of interaction between 
the VerG tip and c1882 or VCA0105 PAAR proteins is —5.5 kcal mol! 
or —3.4kcal mol}, respectively. 

The fold of the PAAR protein is stabilized by a zinc atom positioned 
close to the cone’s vertex (Fig. 1d, Extended Data Fig. 4). The Zn 
binding site consists of three histidines and one cysteine—H14, H46, 
H54 and C81 in VCA0105—that are very well conserved in close 
homologues (Extended Data Fig. 5a). These residues are replaced with 
similar or complementary metal-binding residues (arginines, lysines 
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and glutamines) in more distant homologues suggesting that they also 
carry a metal ion roughly at the same position. The metal ion, being a 
natural ligand for this site, stabilizes the pointed tip of the PAAR 
domain and is likely to be important for its integrity during penetra- 
tion of the target cell envelope. 

The PAAR motif sequence is also conserved, albeit to a lower degree 
than the Zn binding site (Extended Data Fig. 5a). The PAAR motifs 
function to stabilize the fold by forming the central part of the struc- 
ture where the three regions of the polypeptide chain containing the 
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Figure 1 | Crystal structure of the VCA0105 PAAR-repeat protein bound to 
its VgrG-like partner. a, Schematic representation of the conserved domains 
comprising the VgrG-PAAR complex. The last strands of the B-helix that form 
the PAAR binding site are in light blue. The grey arrow shows the fragment 
roughly corresponding to the crystal structure. b, Molecular surface 
representation of the gp5(VCA0018)-VCA0105 complex crystal structure. 
Each protein chain is labelled with its own colour. c, Ribbon diagram of the 
gp5(VCA0018)-VCA0105 complex. d, The polypeptide chain of the VCA0105 
PAAR protein is coloured in rainbow colours with the N terminus in blue and C 
terminus in red. Residues responsible for Zn binding are labelled. PAAR motifs 
are highlighted with thick purple coils. The sequence and residue numbers of 
the starting and ending position for each of the three PAAR motifs are boxed. 


PAAR motif meet and intertwine (Fig. 1d, Extended Data Fig. 5b, c). 
Hydrophobic interactions and buried main chain hydrogen bonds 
mediate the interaction of the three PAAR motifs. (Extended Data 
Fig. 5b, c). The distance between the C, atoms of the third residue 
of the three PAAR motifs in both proteins is only 5.3 + 0.1 A (Extended 
Data Fig. 5b, c).c1882 and VCA0105 show 61.3% sequence identity and 
as a consequence the two structures are very similar with a root mean 
square deviation (r.m.s.d.) of 0.53 A between all the 94 C,, atoms com- 
prising the backbone (Extended Data Fig. 6). 

Given such a critical location of PAAR proteins within the T6SS 
organelle and their high structural conservation, we speculated that 
inactivation of PAAR genes would probably interfere with T6SS func- 
tions including protein secretion and prey cell killing. Because some 
T6SS* organisms have many gene products predicted to carry the 
PAAR motif, we focused on Vibrio cholerae strain 2740-80 and 
Acinetobacter baylyi ADP1, which each encode only two and three 
PAAR proteins, respectively. In A. baylyi inactivation of all three 
PAAR genes (aciad0051, aciad0052, aciad2681) led to an ~90% reduc- 
tion in Hep secretion (Extended Data Fig. 7) and at least a 10,000-fold 
decrease in T6SS-dependent killing of E. coli (Fig. 2a, c). Similarly, 
inactivation of both genes (vca0105 and vca0284) that encode PAAR 
proteins in V. cholerae 2740-80 resulted in a ~100-fold decrease in T6SS- 
dependent killing of E. coliand ~70% reduction in Hep secretion (Fig. 2b, 
c; Extended Data Fig. 7). Remarkably, in both bacterial species, mutants 
inactivated in single PAAR genes showed no or only a modest defect in 
the functionality of their T6SS apparatus (Fig. 2) indicating that PAAR 
proteins within a species are interchangeable. We further attempted to 
complement these double and triple PAAR mutants with heterologous 
PAAR genes but these experiments were inconclusive (Extended Data 
Fig. 8). Thus, PAAR proteins may be very specific for their cognate VgrG 
proteins in the context of the functional T6SS organelle. 

The dramatic reduction in Hcp secretion in the triple PAAR gene 
knockout mutant of A. baylyi suggests that PAAR proteins participate 
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Figure 2 | PAAR proteins are required for full functionality of the T6SS in 
Vibrio cholerae and Acinetobacter baylyi. a, Recovery of viable E. coli 
MG1655 after co-incubation with A. baylyi ADP1 (wild type, WT) and its T6SS 
and PAAR genes knockout mutants. The following genes were inactivated in 
the mutants shown: T6S_, aciad2688 to aciad2694; 2681 , aciad2681; 0051- 
52°, both aciad0051 and aciad0052; 3 X P, all three PAAR genes aciad0051, 
aciad0052 and aciad2681. Leaky, basal expression of the aciad2681 PAAR gene 
from plasmid pMMB67EH”, labelled as p2681, restores the killing defect in the 
triple PAAR mutant (right panel). b, Recovery of E. coli MG1655 colony 


forming units (c.f.u.) after co-incubation with V. cholerae 2740-80 and its T6SS 
and PAAR gene knockout mutants, which are labelled as follows: T6S, vipA; 
105°, vca0105; 284 , vca0284;2 X P”, vca0105 and vca0284. Symbols *, ** and 
*** indicate deviations from the wild type with P values of 6 x 10 3,8x10° 
and 5 X 10°’, respectively, for a sample size of 8. Error bars represent one 
standard deviation. c, SDS-PAGE (sodium dodecyl sulphate—polyacrylamide 
gel electrophoresis assay) for T6SS-dependent secretion of Hcp proteins by the 
parental strains and T6SS and PAAR genes knockout mutants. Panels a and 
c show one out of three experiments with similar outcomes. 
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Figure 3 | The VSV-G epitope-tagged PAAR protein ACIAD2681 is 
secreted by A. baylyi ADP1. T6SS-dependent secretion of VSV-G epitope- 
tagged ACIAD2681 expressed from plasmid pMMB67EH (left panels). VSV-G 
epitope-tagged ACIAD2681 fully restores the Hep secretion defect of the triple 
knockout PAAR mutant of A. baylyi ADP1 (right panel). The mutants are 
labelled as in Fig. 2. A representative of three identical experiments was chosen 
for each panel. 


in the assembly of the T6SS complex by either nucleating the folding of 
VegrG trimers or regulating their incorporation into the T6SS organ- 
elle. Similar to the T4 phage system in which mutants lacking gp5 
(VgrG orthologue) are tailless’’, any disruption in VgrG trimeric 
assembly will probably block the assembly of the T6SS organelle. 
Alternatively, PAAR proteins may be important for another function 
of the apparatus such as translocation of the VgrG spike through the 
predator outer membrane during a sheath contraction event. 

The PAAR-repeat proteins form a diverse superfamily called 
CL15808 in the conserved domains (CDD) database“ that contains 
three families PF05488, COG4104 and PF13665 (or DUF4150). The 
first two families are similar and describe PAAR domains that are on 
average ~95 residues long, whereas the PF13665 family is somewhat 
more distant and its typical members contain ~130 amino acids. The 
crystal structures reported here include representatives of the PF05488 
family. Hundreds of hypothetical proteins in the database contain PAAR 
domains that are extended both N- and C-terminally by domains with 
various predicted functions (Extended Data Fig. 9). The crystal structure 
shows that the termini of the PAAR domain are open to solution and 
thus can be extended without distorting the VgrG binding site (Fig. 1, 
Extended Data Fig. 6). 

The C-terminal domains of many hypothetical PAAR proteins are 
predicted to have various enzymatic activities that are toxic for prokar- 
yotic and eukaryotic cells (Extended Data Fig. 9). Very similar putative 
effector domains can be found fused to the C termini of VgrG proteins 
(for example, VIP2 ADP-ribosyl transferase). Binding of these larger 
PAAR proteins to the tip of VgrG spikes would decorate the T6SS spike 
with a great variety of effector domains. To test this hypothesis, we 
expressed a VSV-G epitope-tagged version of the ACIAD2681 PAAR 
protein in the wild-type A. baylyi ADP1 and in its triple PAAR gene 
knockout mutant (Fig. 3). The epitope-tagged protein was secreted by 
TOSS and fully restored T6SS-mediated killing of E. coli in the triple 
PAAR gene knockout strain (Figs 2a and 3), suggesting that other 
effector proteins fused to PAAR domains will probably also be targeted 
for secretion by binding the tip of VgrG trimers. 

VCA0284, the larger of the two V. cholerae V52 PAAR proteins, 
carries a transthyretin domain (TTR) at its C terminus, which is a very 
common architecture of PAAR proteins (Extended Data Fig. 9). TTR 
is an immunoglobulin-like domain that is known to form oligomeric 
structures in which these domains interact with each other or with 
other partners’®. Thus, PAAR-associated TTR domains may act as adap- 
ters to further decorate the VgrG tip with effectors displaying TTR 
domains or serve to bind the spike to other TTR domain-containing 
proteins such as the TssJ/SciN lipoprotein’’, an outer membrane struc- 
tural component of the T6SS ‘baseplate. 

Besides being in T6SS gene clusters, many PAAR genes are fre- 
quently found downstream of vgrG genes®, which in turn, are often 
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Figure 4 | Multiple effector translocation VgrG (MERV) model for the 
organization of the T6SS central spike/baseplate. Effectors are predicted to 
be loaded onto the spike complex by five distinct mechanisms: 1) C-terminal 
extensions of the VgrG spike; 2) non-covalent binding to the VgrG spike; 3) N- 
or C-terminal extensions of the PAAR protein; 4) non-covalent binding to the 
PAAR protein or its extension domains; 5) incorporation into the cavity 
formed by the gp27 domain of VgrG. A single T6SS sheath contraction event 
translocates the VgrG spike with all of its cargo proteins into a nearby target 
cell. Other proteins making up the T6SS ‘baseplate’ (grey colour) are not 
labelled but presumably reside within or attached to the inner and outer 
membranes and peptidoglycan layer (IM, OM, and PG, respectively). 


encoded by Rhs elements (accessory sequences that have the hallmarks 
of horizontally acquired DNA, Extended Data Fig. 9). A recent report’” 
shows that two different nucleases encoded by the rhsA and rhsB genes 
of Dickeya dadantii are delivered into target cells ina VgrG-dependent 
process, suggesting that the T6SS locus of this organism mediates 
translocation of these effectors. Interestingly, both RhsA and RhsB 
proteins contain PAAR repeat regions and other features that categor- 
ize them as Class 7 PAAR domain architecture proteins (Extended 
Data Fig. 9b). Thus, our working model (Fig. 4) predicts that these 
TOSS nuclease effectors should bind to the tip of VgrG trimers through 
their PAAR repeat domains and in this way be targeted for secretion 
and translocation into prey cells. 

The structural, functional and bioinformatic findings summarized 
above allow us to make several conclusions and predictions. Because a 
PAAR-repeat protein caps the end of the B-helix of a VgrG spike, it is 
in fact the piercing tip that is responsible for the initial event of creating 
an opening in the target cell envelope. Furthermore, because the crystal 
structure demonstrates that the canonical PAAR-repeat domain can 
be extended N- or C-terminally without distorting its structure or its 
VerG B-helix binding site, we predict that large PAAR proteins car- 
rying effector domains will probably also bind to VgrG spikes and be 
translocated into target cells by the T6SS organelle. Considering the 
findings reported here and other published data, we propose that there 
are five mechanisms by which effectors can be incorporated into the 
TOSS spike complex (Fig. 4). Three of them: 1) C-terminal extensions 


8,18, 


of the VgrG spike®; 2) binding surface features on the VgrG protein*”’; 
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and 3) N- or C-terminal extensions of the PAAR protein”; are sup- 
ported by direct or indirect experimental evidence. Two others: 4) 
binding surface features or additional domains (for example, the 
TTR domain) on PAAR proteins and 5) incorporation into the cavity 
formed by the gp27 domain of VgrG, remain speculative. Thus, the 
T6SS machine may be capable of delivering a multifunctional ‘cargo’ or 
multiple effector translocation VgrG (MERV) spike into the target prey 
cell in a single molecular translocation event driven by T6SS sheath 
contraction. 


METHODS SUMMARY 


Structure determination of gp5(VgrG)-PAAR complexes. The gp5 fragment 
containing residues 484-575 and its variants with modified C termini mimicking 
VegrG were cloned into a derivative of the pET-23a expression vector (Novagen) 
with a TEV-cleavable N-terminal polyhistidine tag. For expression in cis, PAAR 
genes were cloned downstream from the gp5 fragment with a spacer containing 
the AGGAGG ribosome-binding site. For expression in trans, PAAR genes were 
cloned into a modified pACYCDuet-1 expression vector (Novagen), retaining its 
wild-type N and C termini. The complexes were purified by metal-affinity, ion 
exchange and size exclusion chromatographies. Conditions yielding diffraction 
quality crystals were refined from the initial hits obtained with the help of com- 
mercial crystallization screens (Jena Bioscience). The structures were solved by 
molecular replacement with the help of PHASER” using the corresponding frag- 
ment of the gp5 B-helix as a search model (Protein Data Bank (PDB) accession ID 
number 1K28)'°. Crystallographic refinement was performed with PHENIX”, 
REFMACS5”! and COOT”. 

Bioinformatics. 1,353 putative orthologues of PAAR repeat proteins were iden- 
tified with the help of HMMER”. Phage proteins and orphan genes were excluded 
from this analysis. Functional assignment was based on high confidence hits 
obtained with the help of HHpred™*, HMMER”, CDD" and CDART”. 
Bacterial killing assays. Bacterial competitions and statistical analysis involving 
V. cholerae 2740-80, A. baylyi ADP1 and E. coli MG1655 were performed as 
described previously’. The plasmid pMMB67EH was used for VSV-G-tagging 
and expression of ACIAD2681 (ref. 26). 

Protein secretion. Protein supernatants were analysed as described previously’. 
Immunoblotting followed standard methodology using anti-VSV-G primary 
antibodies (Sigma). The recombinantly expressed ACIAD2681 PAAR protein 
carried a standard VSV-G epitope tag at its C terminus. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Cloning and protein expression. The gp5 fragment containing residues 484-575 
was cloned into the in-house designed expression vector pEEva2 with using the In- 
Fusion system (Clontech). The pEEva2 vector is a derivative of the pET-23a express- 
ion plasmid (Novagen) with a modified multiple cloning region. The T7-tag was 
replaced with a MGSSH,SSG polyhistidine tag (His-tag) followed by a sequence 
containing a tobacco etch virus (TEV) protease cleavage site (ENLYFQ/GSGS, 
cleavage site denoted by”). Upon expression and after TEV cleavage, the gp5 frag- 
ment contained four residues (GSGS) upstream of residue 484. The C-terminal 
residues of gp5 were modified by performing PCR with long primers containing 
the required mutations (Extended Data Fig. 1). 

For expression in cis, PAAR genes were cloned downstream from the gp5 frag- 
ment. The spacer between the stop codon of gene 5 and the first codon of PAAR 
genes contained a standard ribosome binding sequence and was about 15 bases long. 

For expression in trans, PAAR genes were cloned into the pATE vector (chlor- 
amphenicol selection), which is a modified version of the pACYCDuet-1 expres- 
sion plasmid (Novagen). The dual cloning site of the pACYCDuet-1 vector was 
replaced by the multiple cloning site from the pEEva2 vector. The cloned PAAR 
proteins retained their wild type N and C termini (that is, were tag-free). 
Expression and purification of gp5(VgrG)-PAAR complexes. gp5(VgrG)- 
PAAR complexes were expressed in Escherichia coli B834(DE3) cells grown in 
the 2X TY medium containing ampicillin at 100 jg ml for the cis constructs and, 
additionally, chloramphenicol at 34 jig ml! for the trans constructs. The cultures 
with a total volume of 2 litres were incubated at 37 °C with shaking at 200 r.p.m. 
until the culture reached Déoo nm Of 0.6. The culture was cooled down to 18 °C, and 
the protein expression was induced by an addition of isopropyl-B-p-thiogalactoside 
to a final concentration of 1 mM. The expression continued overnight. 

The cells were collected on the following day by centrifugation at 8,000g at 4 °C 
for 10 min. The cell pellet was resuspended in a lysis buffer that contained 50 mM 
TrisCl pH 8.0, 300 mM NaCl, 5 mM imidazole. The cells were lysed by ultrasoni- 
cation, which was performed on ice with the temperature of the lysate maintained 
below 10°C. The lysate was then centrifuged at 25,000g, at 4°C for 10 min. The 
supernatant was applied to a Ni column (5 ml GE HisTrap FF Crude). The non- 
specifically bound material was removed by washing the columns with 10 column 
volumes of a washing buffer (50 mM TrisCl pH 8.0, 300 mM NaCl, 20 mM imi- 
dazole. The affinity bound material was eluted with 10 column volumes of an 
elution buffer (20 mM TrisCl pH 8.0, 300 mM NaCl, 250 mM imidazole). 

The fractions containing the target protein were pulled together and the TEV 
protease was added to have a concentration of 10% (w/w) of the target protein. The 
mixture was then placed into a dialysis bag, and the proteolysis continued over- 
night with a simultaneous dialysis against a buffer optimal for TEV cleavage 
(10 mM TrisCl pH 8.0, 3 mM DTT (dithiothreitol), 1.5 mM EDTA). The digested 
protein was further purified by ion-exchange chromatography performed with a 
GE Mono Q 10/100 GL column connected to an AKTApurifier 100 system (GE 
Healthcare Life Sciences). The sample was loaded onto the column that was 
equilibrated with buffer A (20 mM TrisCl pH 8.0) and eluted with a linear gradient 
against buffer B (20 mM TrisCl pH 8.0, 1M NaCl). The gradient extended from 0% 
to 65% of the buffer B concentration. Relevant fraction were combined and con- 
centrated using Sartorius ultrafiltration devices with a molecular weight cutoff of 
10,000 to a volume of ~5 ml. This sample was then loaded onto a GE HiLoad 16/ 
60 Superdex 200 size-exclusion column pre-equilibrated with 10mM TrisCl 
pH8.0, 150mM NaCl. The fractions containing pure gp5(VgrG)-PAAR com- 
plexes were combined and concentrated to 25 mg ml with the help of a similar 
Sartorius ultrafiltration devices and without changing the buffer. The protein was 
stored in the same buffer at 4°C until it was used for crystallization. All purifica- 
tion buffers and the final protein solution contained NaN; at a concentration of 
0.02% (w/v). 

Identification of gp5(VgrG)-PAAR complexes. The purification procedure 
employed metal affinity chromatography based on the N-terminal His-tags of 
gp5 fragments with slightly modified C termini (Extended Data Fig. 1), and thus 
led to accumulation of two protein species, gp5(VgrG)-PAAR complexes and gp5 
fragments alone. The -helical fragment of gp5 used in this study does not fully 
denature in SDS-PAGE, and runs as several species with molecular mass of 10- 
30kDa. Most of the PAAR proteins (molecular mass ~9.7 kDa) had the same 
SDS-PAGE mobility as the fastest migrating band of gp5 fragments (molecular 
mass ~10 kDa). However, these species were easily separable by a combination of 
high resolution anion exchange and analytical size exclusion chromatographies 
(monoQ and Superdex 75 resins, respectively). The gp5 fragment is a highly 
negatively charged [-helix’® that binds to the monoQ anion exchange resin 
strongly. Consequently, gp5 fragments that were free from PAAR proteins eluted 
from the monoQ resin by a buffer with a specific conductivity of 24-29 mS cm. 
All gp5(VgrG)-PAAR complexes discussed in this paper were found to be much 
weaker anionic binders and eluted from the monoQ resin by a buffer with a 


specific conductivity of 18-25mScm |. Furthermore, size exclusion profiles 
(Superdex 75 10/300 GL) of all gp5(VgrG)-PAAR complexes had only one peak 
centred at 10.34+ 0.06 ml, whereas gp5 fragments free from PAAR proteins 
formed various oligomers and eluted at different positions of the chromatogram. 
The VCA0284 PAAR protein containing the transthyretin domain was prone to 
aggregation and the gp5(VCA0018)-VCA0284 complex was unsuitable for size- 
exclusion chromatography. Expression of VCA0284 in the presence of the 
gp5(VCA0018)-VCA0018 or gp5(VCA0018)-VCA0123 chimaeras led to an 
almost complete loss of gp5 from solution, indicating complex formation. 

Crystallization and structure determination of gp5(VgrG)-PAAR complexes. 
The initial crystallization screening was carried out by the sitting drop method in 
96-well SWISSCI 2-lens MRC plates using Jena Bioscience crystallization screens. 
Bio-Tek Precision XS and TTP LabTech mosquito pipetting robots were employed 
for preparing crystallization plates and setting up drops each containing 200 nl of 
the protein and the same volume of the well solution. Optimization of crystalliza- 
tion conditions was performed in 24-well SuperClear plates and thick siliconized 
cover slides (both from Jena Bioscience) by hanging drop vapour diffusion. 
Crystallization drops of the 24-well plate setup contained 1.25 ul of the protein 
solution in 10 mM TrisCl pH 8.0, 150 mM NaCl mixed with an equal volume of 
the well solution. Best crystals of the gp5(c1883)-c1882 complex were obtained 
with the protein having the initial concentration of 13.5 mg ml ' and equilibrated 
against 500 pl of the well solution containing 100-150 mM CaCh, 13-15% PEG 
3350, 100 mM MES pH6.5. Best crystals of the gp5(VCA0018)-VCA0105 com- 
plex were obtained with the protein at 15 mgml | and equilibrated against 500 jl 
of the well solution containing 13-14% PEG 2000, 100 mM NaAc pH5.0. 

For data collection, the crystals were dipped for 20-45 into cryo solutions 
containing either 30% of glycerol for gp5(c1883)-c1882 or 25% of 2-methyl-2,4- 
pentanediol for gp5(VCA0018)-VCA0105 in addition to the well solution com- 
ponents and flash frozen in a vaporized nitrogen stream at 100 K. Collection of 
diffraction data and fluorescent scans was carried out at the PXI and PXIII beam 
lines of the Swiss Light Source (SLS) at the Paul Scherrer Institute (Villigen, 
Switzerland), respectively, using X-rays with a wavelength of 1.000A for both 
crystals. Best gp5(c1883)-c1882 and gp5(VCA0018)-VCA0105 crystals diffracted 
to 3.4A and 1.9 A resolution limits, respectively. The diffraction data was indexed, 
integrated and scaled with XDS” (Extended Data Table 1). The structure of the 
gp5(c1883)-c1882 complex was solved by molecular replacement with PHASER” 
using a fragment of the gp5-gp27 complex crystal structure containing residues 
484-575 of gp5 (PDB ID 1K28) as a search model’®. As the asymmetric unit 
contained four gp5(c1883)-c1882 complexes, non-crystallographic symmetry 
(NCS) averaging was used to improve the electron density before model building. 
The model was built manually with COOT”* and refined with REFMACS (ref. 21) 
and PHENIX” using NCS for torsion angles restraints. The structure of 
gp5(c1883)-c1882 was subsequently used as a search model to solve the structure 
of the gp5(VCA0018)-VCA0105 complex by molecular replacement with 
PHASER”. There was only one complex per asymmetric units. The structure 
was refined with PHENIX”’ and COOT”™”*. 

Both refined structures, gp5(VCA0018)-VCA0105 and gp5(c1883)-c1882, had 
excellent stereochemical parameters (see also Extended Data Table 1). All main 
chain dihedral angles of the gp5(VCA0018)-VCA0105 structure were found in the 
most favoured regions of the Ramachandran plot. The gp5(c1883)-c1882 struc- 
ture had 97.3% of its main chain dihedral angles in the most favoured regions of 
the Ramachandran plot, 2.7% in the additionally allowed regions, and none in the 
disallowed region. 

Bioinformatics. Putative orthologues of PAAR repeat proteins were identified 
with the help of HMMER”. A search for high confidence hits to PRAM” domains 
PF05488 or PF13665 that describe the two distinct PAAR repeat domains resulted 
in 1353 PAAR domain-containing proteins. Phage proteins and orphan genes 
were excluded from this analysis. Functional assignment of extension domains 
was based on high confidence hits obtained with the help HHpred**, HMMER”, 
CDD" and CDART” web services. 

Bacterial killing assays. Overnight cultures of V. cholerae 2740-80, A. baylyi 
ADP1 and E. coli MG1655 were washed in LB and diluted 50-200 into fresh 
LB and cultivated for 2.5-3.5 h to reach Deo nm Of ~0.5-1.0. Cells were pelleted by 
centrifugation at 4,000g for 5 min, mixed at Dgo0 nm ~ 10 in 1:1 ratio, and 5 pil of the 
mixture was spotted on a dry LB agar plate. After 2 h at 37 °C, bacterial spots were 
cut out and the cells were resuspended in 1 ml LB. The cellular suspension was 
serially diluted in LB, and 5 ll of the suspensions was spotted on selective plates 
(streptomycin for V. cholerae and A. baylyi, and gentamicin for E. coli). aciad2681 
was expressed basally from the plasmid pMMB67EH”. PAAR protein comple- 
mentation in V. cholerae was done by cloning the indicated PAAR genes into 
pBAD24 (ref. 30) and inducing with the appropriate concentrations of L-arabinose. 
Two-tailed Student’s t-test was used to interpret the statistical significance of the 
number of the colony forming units. 
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Protein secretion. Overnight cultures of V. cholerae 2740-80 and A. baylyi ADP1 
were washed in LB and diluted 200 into fresh LB and cultivated for 5 h at 37 °C. 
One ml of culture supernatants were passed through a 0.2-M filter, precipitated 
with TCA, subjected to 12% SDS-PAGE and stained with Coomassie blue. The 
recombinantly expressed ACIAD2681 PAAR protein carried a C-terminal VSV-G 
epitope tag (YTDIEMNRLGK). Western blots were performed on the TCA pre- 
cipitated samples following standard methodology using anti-VSV-G primary 
antibodies (Sigma). Band intensities were quantified using Fiji software’. 
Molecular graphics. UCSF Chimera* was used to prepare Fig. 1 and Extended 
Data Figs 2-6. 
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a Known: last ae strand 
T4 gp5 569- cormome -575 
vc1416 VgrGl 617-GA SGGSAGSGSG...A-1163 
VCA0018 VgrG2 617-GS SGGSAGSGSG...N-694 
VCA0123_VgrG3 617-GP AGGSAGSGSA. ..K-1017 
EC_c1883 610-GA GGGSPGDVIL...K-652 
EC_c1888 664-GP rn ae 801 
Prediction: last a = TURN 
b 
5 C-terminal VgrG resid PAAR E i 
= adie VarG source ion protein eueerares 
. -HIVGSAINLN VCA0018 (VgrG-2) 614...623 VCA0284 trans 
. -HLVGPAINLN VCA0123 (VgrG-3) 614...623 VCA0284 trans 
. .TIDGSRIDIG gp5 wild type gp5.4 cis 
. . TIDGSRIDIG gp5 wild type c1882*") cis 
. .TIDGSRIDIG gp5 wild type VCA0105 trans 
. -KIKSGNVEIG c3393 726...735 3396 cis 
. -TLKGGNIELG PA1511 732...741 PA1508 cis 


Extended Data Figure 1 | Design of gp5-VgrG chimaeras and analysis of 


their interaction with selected PAAR proteins. a, Identification of the blunt 


ends of selected VgrG f-helices. The last B-strand of the known T4 gp5 


structure is shown in blue. Putative B-strands terminating VgrG -helices are in 


plum colour. The glycine/serine-rich motif is bold highlighted. The residue 
number for the first amino acid of the shown fragment and that for the 


C-terminal amino acid of the protein are given. Abbreviations used EC, E. coli 


CTF073; VC, V. cholerae O1 biovar El Tor str. N16961. b, Binding of several 


PAAR proteins to gp5-based VgrG-like B-helices. Two complexes for which the 
crystal structures are reported in this paper are highlighted with green 
background. Entries showing gp5 modifications that did not result in PAAR 
binding have a grey background. c1882* contains three mutations T28K, T64K 
and T90K that were made to mimic T4 gp5.4. Note that VCA0105 and 
VCA0284 contain threonines and serines in these positions and they bind wild- 


type gp5. 
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Extended Data Figure 2 | Surface features of gp5(VgrG)-PAAR complexes hydrophobicity values are according to the Kyte-Doolittle scale*, which is 
and VgrG-PAAR interface. a, b, Molecular surfaces are coloured accordingto _ given as a coloured bar labelled “KD hydropathy’. Three orientations of the 
their hydrophobicity with sky blue, white, and orange corresponding to the gp5(VgrG)-PAAR complex and an ‘open book’ view of the VgrG-PAAR 
most hydrophilic, neutral and hydrophobic patches, respectively. Residue interface are shown for both PAAR proteins. 
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Extended Data Figure 3 | Main chain hydrogen-bonding network of VgrG- ___ sides of the gp5(VgrG)-PAAR complex. Residues with side chains pointing 
PAAR interface. The dashed line rectangle in the left panel indicates the area __ inwards and forming the VgrG-PAAR hydrophobic interface are in bold italic. 
shown enlarged in the panels on the right. The three right panels show the main _ Side chains are not shown for clarity. The carbon atoms of the PAAR protein 
chain hydrogen bonds between VgrG and PAAR proteins for three different are colored orange. 
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Extended Data Figure 4| PAAR proteins contain a Zn atom. a, X-ray mesh) of both crystal structures contain only one non-noise peak 
fluorescence spectra of gp5(VC0018)-VCA0105 and gp5(c1883)-c1882 corresponding to the Zn atom. The VCA0105 and c1882 maps are contoured at 
crystals and their cryoprotectant solutions. The excitation wavelength is 1.0A 15.0 and 6.0 standard deviations above the mean, respectively. The 
(~12.4keV). The energies and peak heights for the atomic Zn Kx and Kf lines _ corresponding noise level of the two maps is ~5.1 and ~4.5 standard 

are taken from ref. 34. b, c, Anomalous difference Fourier maps (magenta deviations above the mean, respectively. 
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Extended Data Figure 5 | Conserved features of PAAR proteins. 

a, WebLogo” sequence alignment of VCA0105 homologues identified with 
BLAST*®. GenBank accession numbers of protein sequences that were used to 
prepare this WebLogo diagram in order of their appearance in the BLAST 
alignment file: 15600876, 227811731, 153212840, 417818856, 262403304, 
258622274, 261212686, 260769543, 336125205, 417954065, 153801242, 
424808850, 399908303, 15601050, 153817580, 254284956, 444380739, 
307545498, 237731059, 197336227, 387815895, 59712519, 149377963, 
120556511, 359395272, 338997855, 209695798, 338998698, 126665853, 
440287404, 327412950, 145298557, 385332665, 87118978, 433087003, 
433201317, 433072538, 432504178, 222155056, 262169027, 26247740, 
145300690, 422833042, 432730965, 91210698, 432592619, 222156204, 
432758040, 218704924, 26247745, 222156197, 417084314, 50121038, 


WebLogo 3.3 


444375671, 253990116, 423141622, 416895592, 423207453, 417628384, 
331657463, 406676759, 251790158, 323491128, 421082032, 437829594, 
120555573, 433076042, 145301291, 269102718, 261823483, 343512222, 
343512225, 343515540, 343510421, 416895589, 145299403, 117619593, 
385873501, 425070534, 343512226, 343510889, 422021929, 229523366, 
262402814, 385870158, 411009627, 385869971, 258627229, 295418810, 
37525853, 261819862, 307133118, 432995158, 422803770, 253987746, 
253688569, 114319223, 268590359 and 261822686. The conserved PAAR 
motif and residues forming the Zn binding site are labelled. b, c, Pseudotrimeric 
organization of the three interacting PAAR motifs in VCA0105 and c1882, 
respectively. The buried hydrogen bonds stabilizing the fold are shown with 
dashed lines. 
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Extended Data Figure 6 | Superposition of VCA0105 and c1882 PAAR structures. Residue numbers are given at strategic positions. 
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Extended Data Figure 7 | Quantification of Hcp secretion. Hcp secretion the average of three experiments. The error bars correspond to one standard 
was evaluated by quantifying the intensity of the Hcp bandin SDS-PAGE using deviation. The mutants are labelled as in Fig. 2. 
the Fiji software’ and normalizing it to the wild type. The presented data are 
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Extended Data Figure 8 | Complementation analysis of heterologous PAAR 
proteins. a, SDS-PAGE assay for Hep secretion of the V. cholerae 2740-80 
2x P- mutant described in Fig. 2 legend carrying a plasmid expressing 
endogenous or heterologous PAAR proteins. Plasmids pV105, pA0052, and 
pA2681 express PAAR genes vca0105 (Vibrio), aciad0052 (Acinetobacter) and 
aciad2681 (Acinetobacter), respectively, under the control of arabinose- 
inducible pBAD promoter”. N.P. stands for no plasmid. Cells were grown in 
the presence of 0.001%, 0.01% and 0.1% arabinose for 4h. Both panels show a 


typical representative of three identical experiments. b, E. coli MG1655 killing 
by the V. cholerae 2740-80 2 x P' mutant that carries a plasmid expressing 
endogenous and heterologous PAAR proteins. The complementation 
efficiency is calculated as the log), ratio of E. coli recovered after competition 
with the wild type strain of V. cholerae 2740-80 and its 2x P~ mutant 
supplemented with the indicated plasmid. The data represent the average of 
three experiments; error bars correspond to one standard deviation. 
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Extended Data Figure 9 | Bioinformatic analysis of PAAR proteins. a, Domain organization of all known bacterial (non-phage) PAAR proteins. b, Relative 
abundance of the seven distinct domain organizations. c, Predicted functions of the C-terminal domains. TTR stands for transthyretin domain. 
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Extended Data Table 1 | Crystallographic data collection and refinement statistics. 
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Both data sets were collected using a single crystal. *Highest resolution shell is shown in parenthesis. 
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An Spl transcription factor coordinates caspase- 
dependent and -independent apoptotic pathways 


Takashi Hirose’ & H. Robert Horvitz! 


During animal development, the proper regulation of apoptosis 
requires the precise spatial and temporal execution of cell-death 
programs, which can include both caspase-dependent and caspase- 
independent pathways'’. Although the mechanisms of caspase- 
dependent and -independent cell killing have been examined 
extensively, how these pathways are coordinated within a single cell 
that is fated to die is unknown. Here we show that the Caenorhabditis 
elegans Sp1 transcription factor SPTF-3 specifies the programmed 
cell deaths of at least two cells—the sisters of the pharyngeal M4 
motor neuron and the AQR sensory neuron—by transcriptionally 
activating both caspase-dependent and -independent apoptotic path- 
ways. SPTF-3 directly drives the transcription of the gene egl-1, which 
encodes a BH3-only protein that promotes apoptosis through the 
activation of the CED-3 caspase’. In addition, SPTF-3 directly drives 
the transcription of the AMP-activated protein kinase-related gene 
pig-1, which encodes a protein kinase and functions in apoptosis of 
the M4 sister and AQR sister independently of the pathway that acti- 
vates CED-3 (refs 4, 5). Thus, a single transcription factor controls two 
distinct cell-killing programs that act in parallel to drive apoptosis. 
Our findings reveal a bivalent regulatory node for caspase-dependent 
and -independent pathways in the regulation of cell-type-specific 
apoptosis. We propose that such nodes might act as features of a 
general mechanism for regulating cell-type-specific apoptosis and 
could be therapeutic targets for diseases involving the dysregulation 
of apoptosis through multiple cell-killing mechanisms. 

The C. elegans pharyngeal M4 motor neuron is generated during 
embryonic development, whereas the M4 sister cell dies by apoptosis 
soon after its generation (Fig. la)*’. We constructed a P.on23::9fp 
reporter transgene that expresses green fluorescent protein (GFP) spe- 
cifically in the M4 neuron of wild-type animals and in both the M4 
neuron and the surviving M4 sister of ced-3 caspase mutants defective in 
apoptosis (Fig. 1b)*, allowing us to identify efficiently mutants defective 
in M4 sister cell death from a large-scale genetic screen. Among our 
isolates were two non-allelic mutations—n4850 and n4780—which, on 
the basis of mapping and transformation-rescue studies, are alleles of 
the genes sptf-3 and pig-1, respectively (Fig. 1b). 

sptf-3 encodes an Sp1 family transcription factor with a character- 
istic glutamine-rich domain and three C2H2-type zinc finger domains 
(Fig. 1c). The n4850 mutant has a single sptf-3 mutation, at a splice 
acceptor site of the last exon (Fig. 1c). In total, 34% of sptf-3(n4850) 
mutants had a surviving M4 sister, and this cell-death defect was res- 
cued by a transgene carrying only the sptf-3 genomic locus (Fig. le). A 
deletion allele of sptf-3, tm607A, and inactivation of sptf-3 by RNA 
interference (RNAi) both phenocopied the sptf-3(n4850) mutation, 
demonstrating that a reduction of sptf-3 function causes a defect in 
M4 sister cell death (Fig. le). 

pig-1 encodes an AMP-activated protein kinase (AMPK)-related 
protein kinase most similar to mammalian maternal embryonic leu- 
cine zipper kinase (MELK); pig-1 is known to regulate the asymmetric 
cell divisions of several neuroblasts that divide to produce an apoptotic 
cell, including the M4 sister* (Fig. 1d). The 14780 mutant has a single 
pig-1 mutation in the kinase domain, changing a conserved glycine at 


amino acid 172 to glutamic acid (Fig. 1d). In total, 20% of pig-1(n4780) 
mutants had a surviving M4 sister, and this cell-death defect was res- 
cued by a transgene carrying only the pig-1 genomic locus (Fig. le). A 
presumptive null allele of pig-1, gm344A, and inactivation of pig-1 by 
RNAi both phenocopied the pig-1(n4780) mutation, demonstrating 
that a reduction of pig-1 function causes a defect in M4 sister cell death 
(Fig. le). 

Both spéf-3 and pig-1 are required for the deaths of multiple cells, 
including the sisters of the AQR neuron, the pharyngeal gland cells 1A 
(g1 A) and the pharyngeal I2 interneurons (Fig. 1f). By contrast, neither 
sptf-3(n4850) nor pig-1(n4780) affected the deaths of the sisters of 
the pharyngeal neurosecretory motor (NSM) neurons, the sisters of the 
pharyngeal I1 interneurons or the ventral C (VC) homologues of the 
ventral nerve cord (Fig. 1f). Thus, sptf-3 and pig-1 seem to promote 
apoptosis in the same subset of cells fated to die, suggesting that sptf-3 
and pig-1 have a functional interaction in the regulation of cell death. 
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Figure 1 | sptf-3 and pig-1 promote the death of the M4 sister cell. 

a, Schematic representation of the M4 cell lineage in the wild type and mutants 
defective in M4 sister cell death. b, Merged epifluorescence and Nomarski 
images of the pharynx in wild-type, ced-3(n717), sptf-3(n4850) and pig-1(n4780) 
animals expressing P,.y-2s::gfp. Scale bar, 20 jum. c, d, Genomic organizations 
and protein structures of sptf-3 (c) and pig-1 (d), including the mutations n4850 
and n4780. The yellow and green boxes represent the Q-rich and C2H2-type 
zinc finger domains of SPTF-3, respectively. The blue box indicates the kinase 
domain of PIG-1. e, The percentages of M4 sister survival in animals of the 
indicated genotypes. f, The percentages of survival of the indicated cells and the 
number of extra VC homologues are shown. Errors are s.d. *Data from ref. 8. 
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To identify direct transcriptional targets of SPTF-3 involved in the 
regulation of M4 sister cell death, we performed chromatin immuno- 
precipitation with massively parallel DNA sequencing (ChIP-seq) 
analyses using two different SPTF-3 polyclonal antibodies, N81 and 
M82, both of which specifically recognized the SPTF-3 protein (Sup- 
plementary Fig. 1). These experiments identified 2,459 genomic 
regions that immunoprecipitated with both antibodies (Supplemen- 
tary Fig. 2a-d and Supplementary Tables 1, 2). Gene ontology analysis 
indicated that SPTF-3 functions in a variety of biological processes 
(Supplementary Fig. 2e), consistent with the observation that spéf- 
3(n4850), the sptf-3(tm607A) deletion and sptf-3 RNAi knockdown 
all cause cell-fate transformations, embryonic and larval lethality 
and morphological abnormalities (Supplementary Fig. 3 and Sup- 
plementary Fig. 4)’. 

We identified an SPTF-3-bound region immediately upstream of 
the pig-1 coding region (Fig. 2a). This region contains the consensus 
SPTF-3 binding motif (CGCCC) identified from our ChIP-seq ana- 
lyses (Fig. 2b, c). We tested whether the SPTF-3 binding motif of the 
pig-1 promoter region is necessary for pig-1 to promote M4 sister cell 
death. A wild-type pig-1 transgene rescued the M4 sister cell-death 
defect of pig-1(n4780) mutants, whereas neither a pig-1 transgene 
lacking 71 base pairs of the SPTF-3-bound region of the pig-1 pro- 
moter (A71 bp) nor a pig-1 transgene containing mutations in the 
consensus SPTF-3 binding motif (mut.1) rescued the M4 sister cell- 
death defect of pig-1(n4780) mutants (Fig. 2d, e). The wild-type pig-1 
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promoter expressed GFP in many embryonic cells, whereas mutant 
pig-1 promoters lacking the consensus SPTF-3 binding motif (A71 bp 
and mut.1) did not (Supplementary Fig. 5), indicating that the con- 
sensus SPTF-3 binding motif of the pig-1 promoter region is required 
for pig-1 expression. Furthermore, pig-1 transcript levels in sptf-3(n4850) 
mutants were decreased by 43% compared to those of wild-type ani- 
mals (Fig. 2f), and expression of a Ppig-1::gfp transgene was frequently 
absent from the seam cells, P cells, ALM neurons and AVM neuron 
of sptf-3(n4850) mutants (Fig. 2h and Supplementary Fig. 6). Con- 
versely, overexpression of sptf-3 from a multi-copy transgene under 
the control of the sp#f-3 promoter induced ectopic expression of pig-1 
in the seam cells and the hyp7 hypodermal cells (Fig. 2g, h). These results 
indicate that the consensus SPTF-3 binding motif of the pig-1 promoter 
region is required for pig-1 to promote M4 sister cell death and that 
SPTF-3 is necessary and sufficient for pig-1 expression, suggesting that 
SPTF-3 directly drives pig-1 expression in the regulation of M4 sister cell 
death. 

Although SPTF-3 acts through pig-1 to promote M4 sister cell death, 
our genetic observations suggested that sp#f-3 also functions through a 
pathway distinct from that of pig-1. A partial loss-of-function muta- 
tion of sptf-3, n4850, caused a defect in M4 sister cell death more severe 
than that of the pig-1 null mutation gm344A (Fig. le). The M4 sister 
cell-death defect of sptf-3; pig-1 double mutants is much more severe 
than that of either single mutant (Supplementary Table 3). Therefore, 
we tested whether sp#f-3 acts through the pro-apoptotic BH3-only 


Figure 2 | pig-1 is a direct 
transcriptional target of SPTF-3 in 
the regulation of the death of the 
M4 sister cell. a, ChIP-seq results 
obtained by immunoprecipitation 
with the antibodies anti-SPTF-3 N81, 
anti-SPTF-3 M82 or normal IgG and 
input chromatin are shown. Arrow, an 
enriched SPTF-3-bound region. b, The 
consensus SPTF-3 binding motif 
determined by ChIP-seq is 
represented. c, DNA sequence of 
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gene egl-1, which functions in the caspase-dependent pathway of 
programmed cell death. egl-1 acts through the anti-apoptotic BCL2 
homologue ced-9, the pro-apoptotic APAF1 homologue ced-4, and the 
pro-apoptotic caspase gene ced-3 to drive most cell deaths during the 
development of C. elegans’. As egl-1 is required for M4 sister cell death*, 
we scored egl-1 expression in the surviving M4 sister of ced-3 mutants 
defective in apoptosis using a P.gj:gfp reporter transgene that 
expresses GFP under the control of the egl-1 promoter®. Peg1::9fp 
was expressed in the M4 sister (100% of animals; n = 119) but not in 
the M4 neuron (0% of animals; n = 119) in ced-3 mutants (Fig. 3a, b), 
indicating that egl-1 is not only required for the apoptosis of, but also 
is expressed in, the M4 sister. We observed that only 33% of sptf- 
3(tm607A); ced-3 animals expressed P.9/_)::gfp in the M4 sister, whereas 
100% of pig-1(gm344A); ced-3 animals expressed Peo::gfp in the M4 
sister (Fig. 3a, b). Thus, spéf-3 but not pig-1 is necessary for normal eg/-1 
expression in the M4 sister, suggesting that sptf-3 acts through both the 
egl-1-mediated apoptotic pathway and the pig-1-mediated apoptotic 
pathway, whereas pig-1 acts through a pathway distinct from that of 
egl-1 to promote M4 sister cell death. 

We next asked whether SPTF-3 directly or indirectly drives egl-1 
expression in the M4 sister by examining the egl-1 promoter region using 
our ChIP-seq data. We found a small but distinct SPTF-3 binding peak 
immediately upstream of an egl-1 coding region (2.77- or 2.96-fold enrich- 
ment compared to an input control in the ChIP-seq experiments using 
SPTF-3 antibody N81 or M82, respectively) (Supplementary Fig. 7a). This 
SPTF-3-bound region contains two tandem consensus SPTF-3 bind- 
ing motifs (GGGCGGGGCG) (Supplementary Fig. 7b). These results 
suggest that SPTF-3 binds to this region. As SPTF-3 regulates egl-1 
expression in a cell-type-specific manner (Fig. 3a and Supplementary 
Fig. 8) and whole embryos are used for ChIP-seq analyses, it is probable 
that SPTF-3 binds to the eg/-1 promoter region in only a small number of 
cells, resulting in a relatively small binding peak. 

To test whether the SPTF-3-bound region of the egl-1 promoter is 
important for egl-1 to promote M4 sister cell death, we introduced wild- 
type and mutant egl-1 transgenes into egl-1(n1084n3082) mutants 
defective in M4 sister cell death. A wild-type egl-1 transgene, pTH01, 
rescued the M4 sister cell-death defect of eg/-1 mutants, whereas an eg/-1 
transgene lacking 30 bp of the SPTF-3-bound region of the eg/-1 pro- 
moter did not (Fig. 3c, d). To define further the egl-1 promoter region 
important for M4 sister cell death, we introduced a series of mutations 
into this 30-bp region (Fig. 3c). An egl-1 transgene containing muta- 
tions in the GC-rich sequence (mut.2), in the 9 bp next to the GC-rich 


sequence at the 5’ side (mut.1), in the 8 bp next to the GC-rich sequence 
at the 3’ side (mut.3) or in both the GC-rich sequence and the 8 bp next 
to the GC-rich sequence at the 3’ side (mut.4) all rescued the M4 sister 
cell-death defect of egl-1 mutants. By contrast, an egl-1 transgene con- 
taining mutations in both in the GC-rich sequence and 9 bp next to the 
GC-rich sequence at the 5’ side (mut.5) did not rescue the M4 sister cell- 
death defect of egi-1 mutants (Fig. 3d). These results indicate that an 
egl-1 promoter sequence containing the consensus SPTF-3 binding 
motif is required for egl-1 to promote M4 sister cell death, suggesting 
that SPTF-3 directly drives egi-1 expression in the M4 sister. 

To further test the hypothesis that SPTF-3 but not PIG-1 functions 
through eg/-1 in the regulation of M4 sister cell death, we performed 
epistasis analyses between sptf-3 or pig-1 and ced-9, which functions 
downstream of egi-1. Because the ced-9(n2812) null mutation causes 
ectopic cell deaths and organismal lethality, we used a weak ced- 
3(n2446) mutation in these experiments to suppress ced-9(n2812) leth- 
ality (Fig. 3e)'°. sptf-3; ced-9 double mutants had nearly the same 
penetrance of M4 sister survival as that of either single mutant 
(Fig. 3e). By contrast, ced-9; pig-1 double mutants were much more 
highly penetrant for M4 sister survival than either single mutant 
(Fig. 3e). These results indicate that sptf-3 and ced-9 act in a linear 
pathway and that pig-1 and ced-9, and hence pig-1 and the caspase gene 
ced-3 (considering that ced-9 acts by regulating ced-3) act in parallel in 
the regulation of the M4 sister cell death, consistent with previous 
studies that showed that pig-1 regulates other apoptotic deaths inde- 
pendently of ced-3 (refs 4, 5). The C. elegans genome encodes three 
additional caspase genes: csp-1, csp-2 and csp-3. We observed that csp- 
3; csp-1; csp-2 triple mutants were not defective in M4 sister cell death 
(0% of M4 sister survival; n = 120). Because pig-1 mutants are defect- 
ive in M4 sister cell death, we conclude that pig-1 promotes apoptosis 
of the M4 sister independently of csp-1, csp-2 and csp-3 and hence 
through a caspase-independent pathway. 

We next tested whether sp#f-3 acts through pig-1 and egl-1 to pro- 
mote apoptosis of other cells, namely the AQR sisters, g1A sisters and 
12 sisters, all of which survive in sptf-3 and pig-1 mutants (Fig. 1f). The 
wild-type pig-1 transgene rescued the defect in apoptosis of the AQR 
sister, g1A sisters and I2 sisters of pig-1(n4780) mutants, whereas the 
pig-1 transgene containing mutations in the consensus SPTF-3 binding 
motif (mut.1) did not (Fig. 4a—c). Thus, the consensus SPTF-3 binding 
motif of the pig-1 promoter region is necessary for pig-1 to promote 
apoptosis of the AQR sister, g1A sisters and I2 sisters, suggesting that 
SPTF-3 acts through pig-1 to promote apoptosis of these cells. 


Figure 3 | sptf-3 directly drives egl-1 expression 
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Transgenic lines #1#2 #1#2 #1#2 #142 #1#2 #142 #1#2 


Transgene - WT A30 bp mut.1 mut.2 mut.3 mut.4 mut.5 
Genotype egl-1(n1084 n3082) 
e Genotype M4 sister on 
survival (%) 

Wild type 5 120 
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M4 sister of the indicated genotypes. *Data from 
ref. 8. c, Representations of the egl-1 transgenes 
used in rescue experiments. WT, a wild-type egl-1 
transgene; A30 bp lacks 30 base pairs of the SPTF- 
3-bound region; mut.1 to mut.5, mutated as shown. 
Black box, egl-1-coding region. Red box, SPTF-3- 
bound region identified by ChIP-seq. Red letters, 
mutated bases. d, The percentages of M4 sister 
survival in animals of the indicated genotypes. 

e, The percentages of M4 sister survival in animals 
of the indicated genotypes. All strains were 
homozygous for ced-3(n2446). 


Wealso tested whether sptf-3 acts through eg/-1 to promote apoptosis 
of these cells. Whereas the eg/-1 transgene pTHO1 rescued a defect in 
apoptosis of the AQR sister but not of the g1A sisters or I2 sisters in egl-1 
mutants, the egl-1 transgene pBC08 (which overlaps only in part with 
pTH01) rescued the defect in apoptosis of the g1A sisters and 12 sisters 
but not of the AQR sister of egl-1 mutants (Supplementary Fig. 9). Both 
of these egl-1 transgenes contain the SPTF-3 binding site required for 
M4 sister cell death. We therefore tested whether the SPTF-3 binding 
site is required for egl-1 to promote apoptosis of the AQR sister using 
pTH01 and of the g1A sisters and I2 sisters using pBC08. The wild-type 
pTHO01 and pBC08 egl-1 transgenes rescued a defect in apoptosis of 
the AQR sister, g1A sisters and [2 sisters of egl-1 mutants (Fig. 4d-f). 
The mutant pTHO1 egl-1 transgene (mut.5) containing mutations in 
the SPTF-3 binding site of the egl-1 promoter region failed to rescue the 
defect in apoptosis of the AQR sister of egl-1 mutants, whereas the 
mutant pBCO08 egl-1 transgene (mut.5) rescued the defect in apoptosis 
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of the g1A sisters and I2 sisters (Fig. 4d—f). These results indicate that sptf- 
3 promotes apoptosis of not only the M4 sister but also of the AQR sister 
through caspase-dependent and -independent mechanisms through the 
direct transcriptional activation of egl-1 and pig-1, respectively. 

We next determined how sptf-3 and pig-1 interact with other genes 
that specifically regulate M4 sister cell death. We previously reported 
that the C. elegans six family homeodomain protein CEH-34 and the 
eyes absent protein EYA-1 directly drive the transcription of egl-1 in 
the M4 sister to promote M4 sister cell-type-specific apoptosis®. sptf- 
3(n4850) synergistically enhanced the M4 sister cell-death defect of 
pig-1(gm344A) null mutants, whereas ceh-34(n4796) or eya-1(0k6544) 
only additively enhanced this defect (Supplementary Table 3). These 
results indicate that sptf-3, ceh-34 and eya-1 function in pathways 
independently of pig-1, and that sptf-3 probably controls egl-1 at a 
cis regulatory site distinct from that used by the CEH-34-EYA-1 com- 
plex in the regulation of M4 sister cell death. 
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pig-1(gm344A); nEx[pig-1 mut.1] #1 42 120 Transgene - = pBC08 pBCO8 mut5 
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Figure 4 | SPTF-3 functions cell autonomously to promote apoptosis of the 
M4 sister and the AQR sister. a-f, The percentages of survival of the AQR 
sister (a, d), g1A sister (b, e) and I2 sister (c, f) in animals of the indicated 
genotypes. The pig-1(+) transgene carries the wild-type pig-1 genomic locus, 
and the pig-1 mut.1 transgene has mutations in the consensus SPTF-3 binding 
motif in the pig-1 promoter region, as described in Fig. 2d. The egl-1 transgene 
pTHO01 contains 6.5 kilobases (kb) of the 5’ promoter of eg/- 1, the coding region 
and 2.2 kb of the 3’ of the stop codon of egl-1. The egi-1 transgene pBC08 
contains 1.1 kb of the 5’ promoter of eg/-1, the coding region and 5.7 kb of the 3’ 
of the stop codon of egi-1. pTH01 mut.5 and pBC08 mut.5 have mutations in 
the consensus SPTF-3 binding motif of the egi-1 promoter region, as described 
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ig-1 
a Protein phosphorylation 


in Fig. 3c. g, Diagram of the M4 cell lineage. The cells in red were scored to 
determine the cell division at which the sp#f-3-containing extrachromosomal 
array was lost in a mosaic animal. Shown are numbers of animals carrying the 
array in the indicated cell of 56 mosaic animals observed. Green circles, 
blastomeres in which the array was present. Black circles, blastomeres in which 
the array was lost. h, Examples of mosaic animals carrying an 
extrachromosomal array in the indicated cells. +, extrachromosomal array 
present. —, extrachromosomal array absent. i-p, Nomarski and 
epifluorescence images of a strain expressing a gfp::sptf-3 transgene. Scale bar, 
10 pm. q, A model for the pathways that regulate M4 sister and AQR sister cell- 
specific programmed cell death. 
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To identify the cellular site of sptf-3 action in the regulation of M4 sister 
cell death, we performed a genetic mosaic analysis (Fig. 4g, h). We gene- 
rated sptf-3(n4850) mutants carrying an extrachromosomal array con- 
taining an sptf-3-rescuing transgene marked with the cell-autonomous 
GFP reporters sur-5::gfp and unc-119::gfp (refs 11 and 12). Extrachromo- 
somal arrays in C. elegans are mitotically unstable, resulting in mosaic 
animals that randomly lose the array in some cell lineages. We observed 
56 mosaic animals that were not rescued for the defect in M4 sister cell 
death but that carried the array with the sptf-3-rescuing transgene. 
Among these 56 animals, we found two animals that retained the array 
in the blastomere MSpa, the great-great-great-grandmother of the M4 
sister (MSpaaaaap), but did not find any animals that retained the array in 
the blastomere MSpaa, the great-great-grandmother of the M4 sister 
(Fig. 4g, h), indicating that spéf-3 is required at or later than the stage 
of the blastomere MSpaa and seems to function cell autonomously to 
promote the death of the M4 sister. 

To determine the temporal and spatial expression pattern of SPTF-3, 
we generated a gfp::sptf-3 transgene that encodes functional SPTF-3, 
as this transgene rescued the M4 sister cell-death defect of sptf-3 
mutants (data not shown). GFP::SPTF-3 was expressed ubiquitously 
in embryos and early larval animals (Fig. 4i-k and Supplementary 
Fig. 10); its expression in the M4 neuron and glA cells decreased 
during larval development (Fig. 4k, 1 and Supplementary Fig. 10a, b). 
GFP::SPTF-3 was strongly expressed in the seam cells and the hyp7 
cells (Supplementary Fig. 10c, d), consistent with the observation that 
overexpression of sptf-3 induced ectopic expression of pig-1 in these 
cells (Fig. 2g, h). GFP::SPTF-3 localized exclusively to nuclei, consistent 
with the presumed function of SPTF-3 as a transcription factor. We 
detected GFP::SPTF-3 expression in MSpaa in embryos (Fig. 4i,m) and 
in the M4 neuron at the first larval stage (Fig. 4k, 0), indicating that the 
SPTF-3 expression pattern is consistent with our conclusion that sptf-3 
functions cell autonomously to promote the death of the M4 sister. 

Given the cell-autonomous function of sptf-3 and pig-1 (ref. 4), we 
tested whether expression of sptf-3 or pig-1 could induce apoptosis. 
Ectopic expression of the caspase CED-3 under the control of the mec-7 
promoter caused the PLM neurons to die, whereas similar expression of 
either SPTF-3 or PIG-1 failed to do so (Supplementary Fig. 11a). 
Overexpression of SPTF-3 from a multi-copy array under the control 
of the sptf-3 promoter did not result in ectopic apoptosis of the seam 
cells, M4 neuron, AQR neuron, g1A cells and I2 neurons (Supplemen- 
tary Fig. 11b, c), all of which normally survive. Consistent with these 
observations, overexpression of SPTF-3 under the control of the sptf-3 
promoter also did not cause ectopic expression of the pro-apoptotic 
BH3-only gene egl-1 (Supplementary Fig. 12), embryonic or larval leth- 
ality (Supplementary Table 4). These results indicate that expres- 
sion of neither sptf-3 nor pig-1 is sufficient to promote apoptosis and 
suggest that other factors are probably required for sptf-3 or pig-1 to 
promote apoptosis. 

Our findings demonstrate that the apoptosis of the M4 sister cell is 
specified by at least two parallel pathways that function non-redundantly: 
(1) the caspase-dependent apoptotic pathway mediated by eg/-1 and acti- 
vated through direct transcriptional regulation by two different inputs, by 
sptf-3 or by ceh-34 and eya-1; and (2) the caspase-independent apoptotic 
pathway mediated by pig-1 and activated through direct transcriptional 
regulation by sptf-3 (Fig. 4q). In this apoptotic regulatory network, a 
single transcription factor, SPTF-3, coordinates caspase-dependent and 
caspase-independent pathways to promote cell-type-specific apoptosis of 
the M4 sister and the AQR sister. 

Our discovery that there is a common regulatory node for caspase- 
dependent and caspase-independent apoptosis might identify a gene- 
ral mechanism for the regulation of cell-type-specific apoptosis and, 
if so, could have an important therapeutic impact, as both caspase- 
dependent and caspase-independent cell-death processes (such as 
necroptosis) have been implicated in diseases as diverse as glaucoma, 
heart attacks and neurodegeneration’*"'*. Our finding that there can be 
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acommon regulatory node for the two pathways reveals a new possible 
approach to therapeutic intervention. 


METHODS SUMMARY 


C. elegans strains were cultured at 20°C as described previously'®. The N2 strain 
was used as the wild type. The mutations, integrations and extrachromosomal 
arrays used are described in Methods. sptf-3(n4850) and pig- 1(n4780) were isolated 
as described’. Single nucleotide polymorphisms were used to map sptf-3(n4850) 
and pig-1(n4780) to a 136-kb interval (linkage group I: 13,340,839-13,477,415) and 
a 650-kb interval (linkage group IV: 1-657,968), respectively’. For ChIP-seq 
experiments, ChIP was performed as described'*. The enriched DNA and input 
DNA were used to construct libraries for IIlumina/Solexa DNA sequencing using 
HiSeq 2000. The reads were aligned with the C. elegans genome using the software 
Bowtie and analysed as described in Methods. All SPTF-3-bound regions are 
described in Supplementary Tables 1 and 2. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

C. elegans strains and genetics. C. elegans strains were cultured at 20 °C as 
described previously. The N2 strain was used as the wild type. spéf-3(n4850) and 
pig-1(n4780) were isolated as described. Single nucleotide polymorphisms were used 
to map sptf-3(n4850) and pig-1(n4780) to a 136-kb interval (linkage group (LG) I: 
13,340,839-13,477,415) and a 650-kb interval (LG IV: 1-657,968), respectively. The 
following mutations, integrations and extrachromosomal arrays were used. LGI: sptf- 
3(n4850, tm607A), eya-1(0k654A), csp-3(n4872A), nIs177[Pcch-28::9fp> lin-15AB(+)], 
2dIs5[Pinec-4:9fp, lin-15AB(+)]. LGII: csp-1(n4967A), nIs343[Pog 1::gfp, lin-15AB(+)]. 
LGIIN: ced-9(n2812), nIs176[Pcen-28:gfp, lin-15AB(+)], nIs375[Pecy-37:gfp, lin- 
ISAB(+)], 11s390[Php.ssigfp, lin-15AB(+)], nIs579[sptf-3(+), Pmyo.3:mCherry]. 
LGIV: ced-3(n717, n2446), pig-1(n4780, gm344A), csp-2(n4871A), nIs175[Peen-28::gfp 
lin-15AB(+)], nIs578[sptf-3(+), Pryo.3:mCherry], wIs78[ajm-::gfp, scm::gfp, F58E10, 
unc-119(+)]. LGV: ceh-34(n4796). LGX: lin-15(n765), nIs106[Prin-11:gfp, lin- 
I5SAB(+)], 11s283[Pecys0::gfp, lin-15AB(+)], nIs349[Pch,.28:mCherry, lin-15AB(+)], 
nls427[Ppnarrigfp, lin-15AB(+))], n1s429[Ppharsigfp, lin-15AB(+)], nls431[gfp:: 
sptf-3], bels24[Pipp.1:gfp, lin-15AB(+)], nIs540[Ppig.1::gfp, rol-6(su1006)]. Unmapped 
integrations: nIs597[Ppig.1::8fP, Prap.3:mCherry, sur-5::mCherry, lin-15AB(+)], nIs601 
[Ppig-1 471:8fP> Pray-3:mCherry, sur-5::mCherry, lin-15AB(+)}, n1s605[Ppig-1 mut.1SfP> 
Prap-3:mCherry, sur-5:mCherry, lin-15AB(+)}. Extrachromosomal arrays: nEx1453 
and nEx1454[sptf-3(+), Pin-aaigfp], nEx1684[sptf-3(+), Punc-risigfp, sur-5::gfp], 
nEx1971, nEx1972, nEx2105, nEx2106, nEx2109, nEx2110, nEx2113 and nEx2114 
[pig-1(+), Piin-saigfp], nEx1974 and nEx1976[pig-1 A71 bp, Prin-sa:gfp], nEx1978, 
nEx1979, nEx2107, nEx2108, nEx2111, nEx2112, nEx2115 and nEx2116[; [pig-1 mut.1, 
Prin-ag:gfp], nEx1998 and nEx1999[pTHO1 A30 bp, Prin-aa::gfp], nEx2000, nEx2001, 
nEx2101 and nEx2102[pTHO1, Prin-aa:gfp], nEx2002 and nEx2003[pTHO1 mut.2, 
Prin-a4igfp], nEx2004 and nEx2005[pTHO1 mut.1, Piina4:gfp], nEx2006 and nEx2007 
[pTHO1 mut.3, Piin-aaigfp], nEX2008, nEx2009, nEx2103 and nEx2104[pTHO1 mut.5, 
Prin-aaigfp], NEx2010 and nEx2011[pTHO1 mut.4, Piina4:gfp], nEx2091, nEx2092, 
nEx2095, nEx2096, nEx2099 and nEx2100[pBC08, Prin-z4:gfp], nEx2093, nEx2094, 
nEx2098 and nEx2099[pBCO8 mut.5, Prin-a4igfp]. nEX2117 and nEx2118[Pmecz:sptf- 
3(+), Pinec-3:mCherry, rol-6(su1006)], nEx2119 and nEx2120[Pmec-7::pig-1(+), Prec-3:: 
mCherry, rol-6(su1006)}, nEx2121 and nEx2122[P ec-7Ced-3(+ ), Pynec-3:mCherry, rol- 
6(su1006)]. 

Antibody production. Protein fragments corresponding to amino acids 1-79 and 
192-275 of SPTF-3 fused to glutathione S-transferase (GST) were expressed, 
purified using glutathione Sepharose 4B (Amersham Biosciences) and used to 
raise SPTF-3 N81 and SPTF-3 M82 antibodies, respectively. Antisera were gen- 
erated by Pocono Rabbit Farm and Laboratory. Polyclonal antibodies were affin- 
ity-purified using identical SPTF-3 fragments fused to maltose-binding protein 
(MBP) and coupled to Affigel 10 (Bio-Rad). 

ChIP-seq analysis and bioinformatic analyses. Chromatin immunoprecipita- 
tions were performed as described'*. Wild-type adults were grown on nematode 
growth medium (NGM) plates, and embryos were obtained by bleaching the 
gravid adults. Embryos were fixed in 2% formaldehyde for 30 min at room tem- 
perature (23 °C), washed once with 100 mM Tris (pH 7.5), twice with M9 buffer 
and once with FA buffer containing 50 mM HEPES/KOH (pH 7.5), 1 mM EDTA, 
1% Triton X-100, 0.1% sodium deoxycholate, 150 mM NaCl with protease inhi- 
bitors (Roche), and frozen at —80°C. In total, 1.7 ml of packed embryos were 
suspended in 3 ml of FA buffer and homogenized using a dounce homogenizer. 
The sample was then sonicated using a Branson Sonifier 450 equipped with a 
microtip in ice water 24 times at the following setting: output control 1.2 and duty 
cycle constant 30 s ON and 1 min OFF in each cycle. The sample was centrifuged at 
13,000g for 15 min at 4°C. The protein concentration of the embryonic extract 
was determined using a BCA Protein Assay kit (Thermo Scientific). Sarkosyl was 
added to the embryonic extract at a final concentration of 1%, and the embryonic 
extract was centrifuged at 13,000g for 5 min at 4 °C. Embryonic extract containing 
1.54 mg of protein was used as the input control. Embryonic extract containing 
15.4 mg of protein and 75 1g of an affinity-purified SPTF-3 N81 antibody, embry- 
onic extract containing 15.4 mg of protein and 54 jg of an affinity-purified SPTF-3 
M82 antibody or embryonic extract containing 6.4mg of protein and 54 1g of 
normal IgG was incubated at 4 °C overnight for immunoprecipitation in 2 ml of 
FA buffer containing 1% sarkosyl and protease inhibitors. The precipitated immu- 
nocomplexes were collected with Dynabeads protein A (Invitrogen) and washed 
twice with FA buffer for 5 min, once with FA buffer containing 1M NaCl for 
5 min, once with FA buffer containing 500 mM NaCl for 10 min, once with TEL 
buffer containing 250mM LiCl, 1% NP-40, 1% sodium deoxycholate, 1mM 
EDTA and 10 mM Tris (pH 8.0) for 10 min and twice with TE buffer (pH 8.0) 
containing 10 mM Tris (pH 8.0) and 1mM EDTA for 5 min. The immunocom- 
plexes were eluted twice with 300 ul TE (pH 8.0) containing 1% SDS, 250 mM 
NaCl and TE at 65 °C for 15 min. The eluted and input samples were incubated at 
65°C overnight to reverse the crosslinking and then treated with 67 pg ml + of 
RNase A at room temperature for 1h followed by 67 pg ml! of proteinase K at 
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55 °C for 2h. DNA was purified by two phenol-chloroform extractions, followed 
by precipitation with ethanol and re-suspension in 50 pl of H,O followed by 
Illumina/Solexa DNA sequencing using HiSeq 2000. 

ChIP-seq data analysis. Images acquired from an Illumina/Solexa sequencer were 
processed through the bundled Solexa image extraction pipeline, which identified 
polony positions, performed base-calling and generated QC statistics. ChIP- 
derived reads were aligned with the C. elegans genome, Wormbase WS190, using 
the software Bowtie. Only sequences that mapped uniquely to the genome with 
zero or one mismatch were used for further analysis. The enriched sites with 
Pvalue< 10° per antibody defined by ChIP-seq were identified as described”. 
All SPTF-3-bound regions are described in Supplementary Tables 1 and 2. 
SPTF-3 binding motif. The sequences for all regions precipitated with an SPTF-3 
M82 antibody were analysed using a motif-based sequence analysis tool for large 
DNA data sets, Multiple Expectation Maximization for Motif Elicitation (MEME)- 
ChIP” (http://meme.sdsc.edu/meme/cgi-bin/meme-chip.cgi) to identify SPTF-3 
binding-site motifs. Motifs between six and 30 nucleotides were considered, with a 
maximum of six motifs for a data set input. The motifs discovered by MEME-ChIP 
were input to MAST”! to determine consensus motifs from the MEME-ChIP 
output and to search a sequence database for sequences that match the motifs. 
Gene Ontology analysis. A total of 2,459 SPTF-3-enriched binding regions were 
identified in ChIP experiments using both the anti-SPTF-3 N81 antibody and anti- 
SPTF-3 M82 antibodies. Putative target genes with overlapping SPTF-3 binding 
sites located upstream (= 10kb from the transcription start site) or within the 
genes were identified, and these genes were subjected to a Gene Ontology enrich- 
ment analysis using GOstat” (http://gostat.wehi.edu.au/). The complete set of 
annotated genes was used as the background set of genes. 

Plasmid construction. The P.j,.23":9fP> Peeh-2g::Cherry, Pocy.10:8fP> Pegt-1::gfp and 
wild-type egl-1 (pTHO1 or pBCO8) transgenes were described previously***. The 
sptf-3 transgene contained 2.0 kb of 5’ promoter, the coding region and 0.5 kb 3’ of 
the stop codon in the pGEM-T Easy vector. The gfp::sptf-3 transgene contained 
2.0 kb of the 5’ promoter of sptf-3, the gfp gene with synthetic introns, the coding 
region and 0.5 kb 3’ of the stop codon of sptf-3 in pRS426. The pig-1 genomic 
fragment containing 0.8 kb of 5’ promoter, the coding region and 0.6 kb 3’ of the 
pig-1 stop codon was cloned into pRS426, and the last intron was removed using 
two Nrul restriction enzyme sites. The QuickChange II XL Site-Directed 
Mutagenesis Kit (Stratagene) was used to generate the pig-1(A71 bp) construct 
lacking the SPTF-3-bound region, the pig-1 mut.1 construct mutated in the con- 
sensus SPTF-3 binding motif, the egl-1(A30 bp) construct and the egl-1 mut.1- 
mut.5 constructs. sptf-3 complementary DNA was isolated by RT-PCR. The sptf-3 
cDNA fragment corresponding to amino acids 1-79 or 192-275 was cloned in 
pGEX-4T-3 (GE Healthcare Life Sciences) and pMAL-c2 (New England BioLabs) 
to express SPTF-3 protein fragments fused with GST or MBP, respectively. The 
phat-5 promoter sequence in pGD48, provided from J. Gaudet, was cloned in 
pPD122.56 to generate the Ponat-sigfp transgene. The Pry. isigfp transgene con- 
tained 2.4kb of the 5’ promoter of flp-15 in pPD122.56. The P,.y.37::¢fp transgene 
contained 1.1 kb of the 5’ promoter of gcy-37 in pPD 122.56. For the Pnec-7::sptf-3, 
Prrec-7pig-1 and Piec-7:ced-3 transgenes, cDNAs containing the entire coding 
region of each gene were cloned in pPD96.41. The Ppig-1::9f, Ppig-1(471 op):9fP 
and Pyie? mutiigfp transgenes contained 0.9kb of the 5’ promoter of pig-1 in 
pPD122.56. The specific primer sequences are available on request from the 
authors. 

Germline transformation. Germline transformation experiments were performed 
as described". The gfp or mCherry transgenes were injected at 50 or 100 pg ml” * into 
lin-15(n765ts) or ced-3(n717) lin-15(n765ts) animals with 50 1g ml | of pL15EK asa 
co-injection marker”®. The sp#f-3 transgene was injected at 101gml ' into sptf- 
3(n4850) animals with 50 pg ml | of Phin-aaigfp as a co-injection marker” to rescue 
the defect in M4 sister cell death. For expression of sptf-3, the sptf-3 transgene was 
injected at 50 pg ml‘ into nIs540 animals with 5 ig ml ! Of Pryo.3MCherry asa co- 
injection marker”. 

Quantitative RT-PCR. Total RNA from wild-type sptf-3(n4850); nIs349 embryos 
was prepared using an RNeasy Mini kit (Qiagen). Reverse transcription and 
quantitative PCR were performed as described”*. The data presented were gener- 
ated from three PCR reactions, and rp/-26 mRNA levels were used for normaliza- 
tion. The specific primer sequences are available on request from the authors. 
RNAi analysis. Nucleotides 629-1,238 of the sptf-3 cDNA and nucleotides 694— 
2,112 of the pig-1 cDNA were cloned into the pBluescript II vector, respectively. 
Sense and antisense RNA molecules were synthesized using T3 and T7 RNA 
polymerase, respectively, and then annealed to generate double-stranded RNA. 
The double-stranded RNA was injected into the gonads of nIs175 adult hermaph- 
rodites, and their progeny were scored at the first larval stage for a defect in M4 
sister cell death. 

Mosaic analysis. Transgenic animals of genotype sptf-3(n4850); nIs349; nEx 
1684[sptf-3(+), Punc-119::9fp, sur-5::gfp] were generated for the mosaic analysis 
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experiments. Fifty-six mosaic animals that carried the extrachromosomal array 
but that were not rescued for the defect in M4 sister cell death were picked and 
observed using Nomarski optics and epifluorescence to determine the presence or 
absence of the extrachromosomal array as judged by GFP fluorescence in the 
following cells: M4, 15, M5, M1, g1AR, g2R, ccAR, ccPR, I3, g1P, 14, g1AL, g2L 
and intestinal cells. The cell division at which each extrachromosomal array was 
lost was determined based on the cells that retained the array’. 

Analyses of defects in programmed cell deaths of specific cells. The pro- 
grammed cell deaths of specific cells were scored at the indicated stages of larval 
development using the following transgenes, which express GFP in specific cells: 
M4 sister cell death, Is175, nIs176 or nIs177 at the L1 stage; g1A sister cell death, 
nIs429 at the L1 stage; 12 sister cell death, nIs390 at the L4 stage; AQR sister cell 
death, nIs375 at the L2 stage; NSM sister cell death, bcIs24 at the L1 stage; I1 sister 
cell death, nJs283 at the L4 stage; VC homologue cell deaths, nIs106 at the L4 stage. 
The number of the V-lineage-derived seam cells of young adult animals was scored 
using wls78, which expresses GFP in the seam cells. A fluorescence-equipped 
compound microscope was used to score these programmed cell deaths. 
Identification of the M4 neuron and the MSpaa cell. The M4 neuron was 
identified on the basis of its distinctive position in the anterior pharynx. To identify 
the MSpaa cell, we traced its cell lineage starting from the four-cell-stage embryo, 
at which point it is possible to easily distinguish the four blastomeres ABa, ABp, P2 
and EMS (the progenitor of MSpaa). 
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Epithelial junctions maintain tissue architecture by 
directing planar spindle orientation 


Yu-ichiro Nakajima’, Emily J. Meyer", Amanda Kroesen!, Sean A. McKinney’ & Matthew C. Gibson? 


During epithelial cell proliferation, planar alignment of the mitotic 
spindle coordinates the local process of symmetric cell cleavage with 
the global maintenance of polarized tissue architecture’”. Although 
the disruption of planar spindle alignment is proposed to cause epi- 
thelial to mesenchymal transition and cancer®*, the in vivo mechan- 
isms regulating mitotic spindle orientation remain elusive. Here we 
demonstrate that the actomyosin cortex and the junction-localized 
neoplastic tumour suppressors Scribbled and Discs large 1 have 
essential roles in planar spindle alignment and thus the control of 
epithelial integrity in the Drosophila imaginal disc. We show that 
defective alignment of the mitotic spindle correlates with cell dela- 
mination and apoptotic death, and that blocking the death of mis- 
aligned cells is sufficient to drive the formation of basally localized 
tumour-like masses. These findings indicate a key role for junction- 
mediated spindle alignment in the maintenance of epithelial integ- 
rity, and also reveal a previously unknown cell-death-mediated 
tumour-suppressor function inherent in the polarized architecture 
of epithelia. 

Throughout the animal kingdom, mitotic division typically initiates 
with cell rounding driven by Rho-kinase (ROK)-dependent actomyo- 
sin contractility at the cell cortex’®. In polarized epithelia, this process 
is generally followed by planar alignment of the mitotic spindle and cell 
cleavage orthogonal to the plane of the epithelium’””"“*. These events 
effectively coordinate the geometry of chromosome segregation with 
the architecture of polarized cell-cell junctions, thereby ensuring con- 
tinuous epithelial integrity. Nevertheless, despite the critical nature of 
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Figure 1 | Planar orientation of the mitotic spindle during wing disc 

development. a, b, Images of mitotic spindles in xy (a) and xz (b) images. ¢, In 
metaphase, MUD localized to the cell cortex and was enriched on the spindle 
poles. Left panel is a merged image of CNN, PH3 to label mitotic chromosomes, 
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this function, the mechanisms that restrict spindle orientation to the 
plane of the epithelium remain poorly understood, as do the precise 
consequences of spindle misalignment for epithelial organization. 

In order to uncover the mechanism guiding planar orientation of the 
mitotic spindle in the Drosophila wing disc epithelium, we first ana- 
lysed spindle pole movements with a fluorescent centriolar marker, 
Asterless—yellow fluorescent protein (ASL-YFP), or a Centrosomin- 
green fluorescent protein (CNN-GFP) fusion protein (Fig. la and 
Supplementary Fig. 1b, c). In interphase cells, centrioles marked with 
ASL-YFP consistently localized in a super-apical position, well above 
the plane of the adherens junctions or basolateral septate junctions 
(Supplementary Fig. la, b). At prophase entry, centrosomes moved 
basally (Fig. 1b and Supplementary Fig. 1c), and the mitotic spindle 
aligned with the plane of the epithelium during metaphase (averaging 
6.5 + 4.9° from planar) and anaphase (4.6 + 3.7° from planar; Fig. la, 
b and Supplementary Fig. 1d, e). Indeed, mitotic centrosomes were 
almost always constrained to the region of the cell cortex defined by 
NRG-GFP accumulation at the septate junctions (n = 97 out of 100 
metaphase figures; Supplementary Fig. 1d). Accordingly, metaphase 
and anaphase spindle orientations were consistently within the maximal 
limits set by the septate junction-delimited mitotic zone (estimated at 
25.6 +1.3°,n=91). 

The mitotic spindle could achieve planar orientation through either 
an active mechanism involving spatial cues or, more simply, alignment 
with the cellular long axis (for example, Hertwig’s rule). Inconsistent 
with a long-axis orientation mechanism, however, both interphase and 
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and MUD. Right panel shows MUD alone. d, e, Quantification of mitotic 
spindle alignment in control (d) and mud RNAi (e) wing discs. n indicates the 
number of spindles observed. Scale bars, 5 lum. 
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mitotic cells were more extended along their apicobasal axes (data not 
shown; Supplementary Fig. 1f), suggesting that molecular cues guide 
planar spindle alignment. MUD, a Drosophila orthologue of the verte- 
brate spindle regulator nuclear mitotic apparatus protein (NuMA), con- 
trols spindle orientation in neuroblasts and embryonic epithelia’*”. 
In wing discs, MUD was strongly enriched on the spindle poles during 
mitosis (Fig. lc and Supplementary Fig. 2a). To determine the role of 
MUDin planar spindle orientation, we expressed an RNA interference 
(RNAi) knockdown construct under the control of the wing-specific 
driver nubbin-GAL4 (nub-GAL4>UAS-mud*™"), In mud RNAi discs, 
36.4% of the spindles exhibited aberrant planar orientation (n = 88; 
Fig. 1d, e and Supplementary Fig. 2b, c), indicating that MUD func- 
tions to position the spindle in the plane of the epithelium. 

During asymmetric cell divisions in Drosophila and mammalian 
tissue culture, cues embedded in the actomyosin cortex direct spindle 
orientation through interaction with astral microtubules'*”’. In wing 
discs, we observed a pronounced accumulation of F-actin, phospho- 
myosin and the actin-binding protein moesin (MOE-GFP) at the 
cortex of mitotic cells (Fig. 2a, b)’. MOE is the sole Drosophila ezrin/ 
radixin/moesin (ERM) protein, and upon phospho-activation it cross- 
links the plaama membrane to the actomyosin cortex to modulate cell 
shape*’. Consistent with a potential role for MOE in both mitotic 
rounding and planar spindle orientation in the wing disc, phospho- 
MOE was enriched in mitotic cells, where it localized in a polarized 
manner to the basal cortex (Fig. 2c, d and Supplementary Fig. 3a). 

To test the role of the cell cortex in planar spindle alignment, we 
used cytochalasin D (CytoD) to disrupt actin polymerization and 
Y-27632 to disrupt ROK, which phospho-activates both myosin and 
MOE”™”". Notably, following a 30-min incubation of live wing discs 
with CytoD, mitotic spindles rotated orthogonal to the plane of the 
epithelium (78.4% strongly misoriented, n = 111; Fig. 2e, f and Sup- 
plementary Fig. 3b). Inhibition of ROK with Y-27632 caused similar 
defects, with 80.8% of spindles becoming severely misoriented (n = 26; 
Fig. 2e, g and Supplementary Fig. 3b). To confirm these results gen- 
etically, we knocked down rok and Moe by RNAi. In discs expressing 
rok RNAi (nub-GAL4>UAS-rok®™), 58.6% of spindles were misor- 
iented orthogonal to the plane of the epithelium, and another 19.5% 
exhibited milder defects (n = 133; Fig. 2h, iand Supplementary Fig. 3c). 
Wealso expressed rok RNAiin cell clones induced by the flp-out/GAL4 
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Figure 2 | The actomyosin cortex is required for planar spindle orientation. 
a, F-actin accumulated at the cortex of mitotic cells. b, MOE-GFP localized to 
the cortex of mitotic cells in the presumptive wing pouch. c, d, Phospho-MOE 
(p-MOE) was specifically enriched in the basal cortex of mitotic cells. Panels in 
d show different planes according to the z-positions indicated in 
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technique and observed similar abnormalities (Supplementary Fig. 3e). 
Consistent with these effects, 28.6% of spindles from nub-GAL4>Moe®N*i 
were strongly misoriented and 17.9% had milder abnormalities (n = 112; 
Fig. 2h, j and Supplementary Fig. 3c). These phenotypes were confirmed 
with independent constructs (Supplementary Fig. 3d), indicating that 
ROK and MOE function to control planar spindle orientation during 
mitosis. 

We next investigated how the plane of the epithelium is interpreted 
by mitotic cells. Septate junctions, which define the miotic zone and 
correlate with the positioning of the spindle poles (Fig. 1), correspond 
to vertebrate tight junctions, and are composed of core components 
including Coracle, Neurexin IV and Neuroglian, as well as the assoc- 
iated neoplastic tumour suppressors Scribbled (SCRIB) and Discs large 
1(DLG, also known as DLG1) (Fig. 3a and Supplementary Fig. 4a-d)””. 
Intriguingly, mitotic spindle poles localized within 100 nm of septate 
junctions in intact wing discs (Fig. 3b, c and Supplementary Fig. 4e). 
Further, extensive epithelial disorganization is observed in scrib and dlg 
mutant imaginal discs”, raising the possibility that their neoplastic phe- 
notypes partly reflect defects in the control of planar spindle alignment. 

To determine the requirements for SCRIB and DLG in mitotic spindle 
orientation, we first used RNAi to knock down scrib in developing wing 
discs (nub-GAL4> UAS-scrib®™’, Supplementary Fig. 4c, d). Unexpectedly, 
given its role in the establishment of epithelial polarity”, cells expres- 
sing scrib RNAi exhibited no obvious apicobasal polarity defects in the 
late third instar (Fig. 3d and Supplementary Fig. 5). At the same 
experimental time point, however, more than 40% of spindles showed 
abnormal planar orientation (Fig. 3d-f and Supplementary Fig. 6a). 
The frequency of spindle misorientation progressively increased dur- 
ing an abnormal developmental delay (Supplementary Fig. 6b, c). 
Consistent with cooperative activity of SCRIB and DLG, DLG local- 
ization became diffuse after scrib knockdown (Supplementary Fig. 7a), 
and we observed spindle orientation defects in discs expressing dlg 
RNAi (Fig. 3g and Supplementary Fig. 7b, c). To confirm these results, 
we generated mutant clones using the mosaic analysis with a repres- 
sible cell marker (MARCM) technique. Spindle orientation was nearly 
randomized in scrib' mutant cells undergoing mitosis at the apical 
epithelial surface (Fig. 3h, i and Supplementary Fig. 8a—c). This con- 
trasts with the tendency for the spindle to reorient perpendicular to the 
plane of the epithelium following disruption of actin or ROK (Fig. 2e-j), 
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Figure 3 | SCRIB and DLG determine planar orientation of the mitotic 
spindle. a, SCRIB-GFP localized to septate junctions, correlating with spindle 
position. b, A rounded anaphase cell (yellow line) exhibits chromosome 
segregation (arrows). c, Centrioles (arrows) localized in close proximity to 
septate junctions (SJ, bracket) in a magnification of the red box from 

b. d, e, nub-GAL4>UAS-scrib®’*! wing discs exhibit misoriented spindles 
(dotted line in e) despite normal epithelial architecture and normal apical 


perhaps owing to defects in mitotic cell rounding in those manipulations 
(Supplementary Fig. 9). Taken together, these results show that SCRIB 
and DLG function as determinants of planar spindle orientation and 
raise the possibility that the neoplastic phenotypes associated with 
lesions in scrib or dlg partly result from spindle orientation defects 
(Fig. 3d, e and Supplementary Fig. 6). Hypothetically, a primary defect 
in spindle orientation could cause a secondary breakdown of junctional 
architecture, leading to what is effectively an epithelial-to-mesenchymal 
transition (EMT). 

Loss of tissue architecture is a hallmark of epithelial cancer’, and 
our results indicate that defects in planar spindle orientation could be a 
critical initiating event in neoplastic overgrowth of scrib and dig mutant 
discs (Supplementary Fig. 6c). However, misorientation of the spindle 
may not be sufficient to initiate tumorigenesis, as neoplastic phenotypes 
were not observed after mud or rok knockdown. Instead, in nub- 
GAL4>rok®’ discs and in clones expressing rok RNAi, basally loca- 
lized cells lost polarity markers, underwent apoptotic cell death, and 
were removed from the epithelium, as previously observed in Moe 
mutants (Fig. 4a, b and Supplementary Fig. 10a)”°. To determine the 
consequence of spindle misorientation, we established a custom selective 
plane illumination microscopy (SPIM) system. In nub-GAL4>rok*®\*' 
wing discs, the basal daughters of severely misoriented divisions exhib- 
ited rapid basal movements, consistent with their eventual delamination 
from the epithelium (Supplementary Fig. 10d-f and Supplementary 
Videos 1, 2). Further, basally delaminated cells frequently exhibited 
centrosomal accumulation of CNN-GFP in the absence of mitotic 
markers (Supplementary Fig. 10g-i), suggesting that these cells may 
have undergone abnormal divisions and delaminated from the epithe- 
lium before their programmed death. 

To block apoptosis of misaligned daughter cells in nub-GAL4>rok*\*' 
discs, we co-expressed the caspase inhibitor p35. This resulted in a highly 
disorganized epithelial structure when compared to nub-GAL4>rok*\™! 
alone (Fig. 4c, d). Intriguingly, increased numbers of apoptotic cells were 
also observed following scrib knockdown (Supplementary Fig. 10b), 
and the neoplastic phenotype was again strongly enhanced by caspase 
inhibition (Supplementary Fig. 11a). Combined, these results suggest 
that abnormal epithelial cells arising from spindle misalignment are 
normally cleared by apoptotic cell death and thereby prevented from 
disrupting epithelial organization. 

To extend these findings as a model for EMT by aberrant spindle 
orientation, we knocked down rok in cell clones co-expressing UAS-p35. 
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localization of ECAD. e, Magnification of the box in d. f, g, Quantification of 
mitotic spindle alignments in scrib RNAi (f) and dig RNAi (g) wing discs 
compared with control data from Fig. 1d. h, i, Quantification of metaphase 
spindle alignments in control and scrib’ MARCM clones. The red line shows 
the median angular deviation. n indicates the number of spindles observed. 
Scale bars, 5 tum (a, e-i) and 20 tm (d). 
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Figure 4 | In the absence of apoptosis, spindle misorientation induces EMT- 
like effects. a, b, rok knockdown induced basal cell delamination, loss of the 
Armadillo polarity marker (ARM), and frequent apoptosis. c, d, Suppression of 
cell death with p35-enhanced tissue disorganization in rok RNAi wing discs. 
e, Control clones expressing p35 were normally integrated in the epithelial 
layer. By contrast, GEP™ cell clones expressing p35 with rok RNAi (f) or mud 
RNAi (g, h) exhibited morphological and molecular characteristics of EMT. 
h, Strong MMP1 expression and actin remodelling observed in a basal tumour- 
like mass. i, Conceptual model for the contribution of spindle misorientation to 
aberrant EMT. Scale bars, 20 tm. 
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These clones formed large disorganized masses of aberrant cells on the 
basal side of the disc (n = 55 out of 55 discs, Fig. 4e, f). Intriguingly, the 
abnormal cells exhibited molecular characteristics of EMT”, including 
loss of E-cadherin, induction of Matrix metalloproteinase 1 (MMP1) 
and actin remodelling (Supplementary Fig. 12a, c, e). A similar EMT- 
like effect was observed in scrib RNAi clones co-expressing p35 (n = 40 
out of 61 discs; Supplementary Fig. 11b), as well as in scrib’ mutant cells 
expressing UAS-p35 (Supplementary Fig. 11c). Finally, to more directly 
implicate spindle misorientation in this process, we used UAS-p35 to 
suppress the cell death normally observed in cells expressing mud 
RNAi (Supplementary Fig. 10c). This manipulation phenocopied the 
effects of disrupting rok or scrib (n = 33 out of 38 discs; Fig. 4g, h and 
Supplementary Fig. 12b, d, f), demonstrating that in the absence of 
corrective cell death, spindle misorientation is alone sufficient to induce 
many key features of EMT. Crucially, the inhibition of apoptosis in 
dying cells did not lead to the formation of basal tumour-like masses, 
indicating that the EMT-like effect cannot simply be attributed to 
forced cell survival (n = 1 out of 54 discs; Supplementary Fig. 11d). 
Combined, these results show that misorientation of the mitotic 
spindle is typically corrected by cell death, but can result in EMT under 
conditions where normal apoptotic mechanisms are compromised 
(Fig. 4i). 

Here we demonstrate that planar alignment of the mitotic spindle 
requires interactions between the mitotic apparatus, the actomyosin 
cortex and the septate-junction-localized scaffolding proteins SCRIB 
and DLG. Although SCRIB and DLG have long been implicated in 
the suppression of neoplastic tumours in Drosophila and humans****””, 
our results uncover a novel requirement for both proteins during 
alignment of the mitotic spindle to the plane of the epithelium 
(Fig. 3). By contrast, other potential tumour suppressors linked to cell 
polarity did not affect planar spindle orientation in the wing disc 
(Supplementary Table 1). On the basis of these results, we propose 
that aberrant cleavage plane orientation could be a significant contrib- 
utor to the loss of epithelial integrity in scrib or dig mutant cells (Fig. 4i). 

In addition to the above findings, our results demonstrate a critical 
function for planar orientation of the mitotic spindle in the mainten- 
ance of epithelial architecture. Consistent with this view, disruption of 
spindle orientation with rok or mud RNAi was sufficient to trigger 
delamination of basal daughter cells (Supplementary Fig. 10 and Sup- 
plementary Videos 1, 2). When combined with the suppression of cell 
death, these manipulations produced delaminated mesenchyme-like 
masses exhibiting some molecular characteristics of EMT (Fig. 4h and 
Supplementary Fig. 12). During the early steps of tumour progression 
in humans, sporadic mutant cells are thought to acquire a potential to 
outgrow normal tissue by escaping from the epithelial sheet through 
EMT or EMT-like events**°. On the basis of our findings, we propose 
that the deleterious effects of aberrant spindle alignment are typically 
corrected by apoptosis, and that suppression of this corrective mech- 
anism could be a common initial driver of epithelial dysplasia and 
tumorigenesis in vivo. 


METHODS SUMMARY 


Detailed information including the fly stocks, immunofluorescence, drug treat- 
ment, imaging and image analysis, and associated references, are included in 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Fly stocks and genetics. The following stocks were used: OreR for wild-type 
control, A9-GAL4, UAS-cnn-GFP*!, nub-GAL4, His2Av-mRFP (monomeric red 
fluorescent protein), UAS-Moe-GFP*, UAS-p35, UAS-reaper, UAS-GFP, UAS- 
Dicer-2, Asl-YFP*, Nrg-GFP**, scrib-GFP*°, hs-flp UAS-mCD8-GEP; Tub-GAL4 
FRT82B Tub-GAL80 for MARCM”* clones, yw; FRT82B, yw; FRT82B scrib'/ 
TM6C™, hs-flp; Act>y'>GAL4 UAS-GFP for FLP/FRT-mediated (FLP-out) 
clones, UAS-mud®“‘ (Bloomington 28074, Bloomington 35044), UAS-rok®N4i 
(Bloomington 28797, Bloomington 34324), UAS-Moe*”’ (VDRC 37917, Bloomington 
33936), UAS-scrib®™’ (Bloomington 35748), UAS-dig*““’ (Bloomington 33629, 
Bloomington 25780) for RNAi knockdown experiments (VDRC” and TRiP*). 
Larvae were collected at 25 °C. Clones of RNAi constructs or mutants were gen- 
erated with a 1-h heat shock at 48 h after egg lay. 

Immunofluorescence. The following antibodies and dyes were used: rabbit anti- 
phospho-histone H3 (1:1,000, Millipore), mouse anti-phospho-histone H3 (1:2,000, 
Millipore), mouse anti-y-tubulin (1:1,000, Sigma), rat anti-c.-tubulin (1:250, Serotec), 
rabbit anti-phospho-MOE (1:100, Cell Signaling), rabbit anti-cleaved CASP3 (1:500, 
Cell Signaling), guinea pig anti-CNN (1:1,000, F. Matsuzaki), mouse anti- MUD 
(1:50, F. Matsuzaki), rabbit anti-SCRIB (1:5,000, C. Doe), rabbit anti-BAZ (1:500, 
F. Matsuzaki), mouse anti-DLG (1:500, DSHB), mouse anti-ARM (1:200, DSHB), 
rat anti-ECAD (1:25, DSHB), mouse anti-CORA (1:100, C615.16, DSHB), mouse 
anti-FAS3 (1:100, DSHB), mouse anti-MMP1 (1:100, 1:1:1 cocktail of 5H7B11, 
3A6B4, 3B8D12, DSHB), rabbit anti-aPKC-C C20 (1:1,000, Santa Cruz Biotech- 
nology), fluorescent second antibodies (1:500, Invitrogen), Alexa Phalloidin 488, 
546 (1:500, Invitrogen) and Hoechst 33342 (2 uM, Thermo Scientific). 

Drug treatments. CytoD (5 1M, Calbiochem) and Y-27632 (ref. 39) (1 mM, Calbiochem) 
were diluted in PBS and wing discs were incubated for 30 min as described’. These 
concentrations do not disrupt wing epithelial morphology during the assay period. 
Imaging and image analysis. Confocal images were collected with a X63 glycerol 
or X40 oil objective lens on a Leica SP5 AOBS confocal microscope system (Leica 
Microsystems). To measure mitotic spindle orientation, images of mitotic spindles 
were taken by optical cross section (xz scan). The angle between the line perpen- 
dicular to the epithelium and the line through the spindle poles was measured with 
Fiji/Image] (Supplementary Fig. 1d). On the basis of the maximal outer limits 
defined by the average length of the septate junctions (25.6 + 1.3°, n= 91), we 
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scored spindles as abnormal when angular deviation from planar (0), 0 = 30°. Values 
60 = 0 = 90° were considered strong misorientation and values 30 = 0 = 60° were 
considered mild. In Figs 1-3, prometaphase/metaphase cells are shown for the 
quantification of mitotic spindle orientation. Spindle orientation in anaphase/ 
telophase cells are in Supplementary Information. 

SPIM. Wing discs were embedded in 1% agarose in Drosophila Ringers’ solution. 
Embedded wing discs were then placed in a chamber filled with fly media’ and 
imaged every minute for 2-3h at approximately 21°C using SPIM"'. Fifty Z- 
sections with 1-1m spacing were collected with an exposure time of 27 ms using 
488 nm and 561 nm laser excitation scanned sequentially and detected using a W 
Plan-Apochromat 40/1.0 NA objective. Further details can be found in 
Supplementary Information. 
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Co-crystal structure of a T-box riboswitch stem I 
domain in complex with its cognate tRNA 


Jinwei Zhang' & Adrian R. Ferré-D’Amaré! 


In Gram-positive bacteria, T-box riboswitches regulate the expres- 
sion of aminoacyl-tRNA synthetases and other proteins in res- 
ponse to fluctuating transfer RNA aminoacylation levels under 
various nutritional states’. T-boxes reside in the 5’-untranslated 
regions of the messenger RNAs they regulate, and consist of two 
conserved domains. Stem I contains the specifier trinucleotide that 
base pairs with the anticodon of cognate tRNA. 3’ to stem I is the 
antiterminator domain, which base pairs with the tRNA acceptor 
end and evaluates its aminoacylation state’. Despite high phylo- 
genetic conservation and widespread occurrence in pathogens, the 
structural basis of tRNA recognition** by this riboswitch remains 
ill defined. Here we demonstrate that the ~100-nucleotide T-box 
stem I is necessary and sufficient for specific, high-affinity (dissoci- 
ation constant (Ky) ~150 nM) tRNA binding, and report the struc- 
ture of Oceanobacillus iheyensis glyQ stem I in complex with its 
cognate tRNA at 3.2 A resolution. Stem I recognizes the overall 
architecture of tRNA in addition to its anticodon, something 
accomplished by large ribonucleoproteins such as the ribosome, 
or proteins such as aminoacyl-tRNA synthetases’, but is unprece- 
dented for a compact mRNA domain. The C-shaped stem I cradles 
the L-shaped tRNA, forming an extended (1,604 A’) intermolecu- 
lar interface. In addition to the specifier-anticodon interaction, 
two interdigitated T-loops near the apex of stem I stack on the 
tRNA elbow in a manner analogous to those of the J11/12-J12/11 
motif® of RNase P and the L1 stalk’ of the ribosomal E-site. Because 
these ribonucleoproteins and T-boxes are unrelated, this strategy to 
recognize a universal tRNA feature probably evolved convergently. 
Mutually induced fit of stem I and the tRNA exploiting the intrinsic 
flexibility of tRNA and its conserved post-transcriptional modifi- 
cations results in high shape complementarity, which in addition to 
providing specificity and affinity, globally organizes the T-box to 
orchestrate tRNA-dependent transcription regulation. 

T-box riboswitches contain conserved stem I and antiterminator 
domains separated by a variable linker. In some T-boxes, this linker 
contains three stem-loops. In the tRNA“-specific glyQ and glyQS 
T-boxes, there is only one short stem-loop and a ~13-nucleotide 
single-stranded segment (Extended Data Fig. 1). We analysed the 
contribution of different T-box segments to tRNA binding by isother- 
mal titration calorimetry (ITC). Full-length Bacillus subtilis glyQS 
T-box, a 3’ truncation leaving only the antiterminator residues that 
base pair with the tRNA acceptor end, and an isolated stem I domain 
all bind tRNA” with comparable affinity (Fig. la—c). Therefore, stem 
I provides essentially all the binding energy to tRNA. Proximal or distal 
truncations of stem I abrogate binding (Extended Data Fig. 2). 
Mutagenesis shows that high-affinity binding is contingent on cognate 
specifier—-anticodon base pairing (Extended Data Table 1). However, 
stem I recognizes tRNA elements outside of the anticodon stem-loop 
(ASL), because an isolated ASL derived from tRNA® does not bind 
appreciably (Extended Data Fig. 2). We conclude that stem I is neces- 
sary and sufficient for specific, high-affinity tRNA binding by the 
T-box riboswitch. 
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Figure 1 | Overall structure of the T-box stem I in complex with tRNA. 

a, ITC analysis of tRNA binding by full-length glyQS T-box (residues 14-182). 
b, Binding by a 3’-truncated T-box (residues 14-158). c, Binding by an isolated 
T-box stem I (residues 14-113). d, Sequence and secondary structure of the co- 
crystallized glyQ stem I and tRNA“ RNAs. Leontis-Westhof” symbols denote 
non-canonical base pairs. Lines with embedded arrowheads denote chain 
connectivity. The tRNA (shaded) is numbered conventionally (‘t’ precedes 
tRNA residues). e, Cartoon of the complex structure. Colour-coding as in 

d; segments altered to facilitate crystallization and the YbxF protein are in white. 
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Figure 2 | Interactions between stem I and tRNA. a, Specifier-anticodon 
interaction. Dashed lines denote hydrogen bonds. Displacement of A86 forms a 
pocket (dashed oval). b, The two interdigitated T-loops at the distal end of stem 
I stack on the tRNA elbow. c, Stacking of the distal stem I base triple on the 


To understand how an RNA domain of only ~100 residues can 
recognize a 76-nucleotide tRNA, we co-crystallized stem I from the 
glyQ T-box riboswitch of Oceanobacillus iheyensis (Fig. 1d) with 
tRNA®” and the bacterial K-turn binding protein YbxF, and solved 
its structure at 3.2 A resolution (Methods). Stem I folds into an irregu- 
lar helix of ~110 A contour length that cradles tRNA (Fig. le). Several 
RNA structural motifs are arranged linearly in stem I, consistent with 
phylogenetic conservation’*"'. The proximal end of stem I contains a 
K-turn’, which introduces a 120° bend into the helical path. YbxF 
recognizes the K-turn as seen previously in its SAM-I riboswitch 
complex’’. Immediately distal is the specifier trinucleotide, which base 
pairs with the tRNA anticodon, as predicted genetically’. This is fol- 
lowed by a loop E motif", with the extruded G20 forming an S-turn. 
The helical trajectory bends ~65° at a dinucleotide bulge (residues 
29-30), allowing stem I to track the tRNA. A bulge (but not its 
sequence) is conserved at this location in T-box stem I domains" 
(Extended Data Figs 1 and 3). The phylogenetically conserved second- 
ary structure of the distal portion of stem I contains another bulge 
followed by an apical loop (Extended Data Fig. 1). In our co-crystal 
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tRNA elbow. Yellow dashed lines denote tRNA elbow base pairing. d, tRNA 
residue tU20 flips out to stack with the stem I C64 ribose (van der Waals 
surfaces of interacting residues shown). e, Mutagenesis and ITC analysis of 
selected stem-I-tRNA interactions. Data are mean and s.e.m. (n= 2). 


structure, this bulge and the loop each folds into a pentanucleotide 
T-loop’*. The two T-loops interdigitate and recognize the structurally 
conserved elbow of tRNA. Association with tRNA®” buries 1,604 A? 
of the solvent-accessible surface area of stem I (Extended Data Fig. 3). 
Of these, 939 A? are at the specifier-anticodon interaction and three 
adjacent interfaces where the backbones of loop E and two stem I 
segments on the same helical face contact backbone atoms of the 
tRNA anticodon stem. The remaining 665 A” are at the apical region 
of stem I, which contacts two D-loop residues and one T-loop residue 
of the tRNA. The solvent-accessible area of tRNA buried by the T-box 
is comparable to those buried by several aminoacyl-tRNA synthetases 
(ARSs)°, but much larger than the ligand surface areas buried by most 
riboswitches"®. 

T-boxes have a conserved purine'' immediately 3’ to the three 
specifier nucleotides. Our structure reveals that this nucleotide 
(A90) stacks underneath the wobble base pair (C89*tG34; tRNA res- 
idue numbers are preceded by ‘t’) and also hydrogen bonds to A1l6 
(Fig. 2a). A similar stabilizing interaction occurs in the P-site of the 
ribosome, where the universally conserved C1400 of 16S ribosomal 
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Figure 3 | Induced fit of tRNA by stem I binding. a, Superposition of ASL of 
free tRNA®Y (Protein Data Bank (PDB) accession code 2LBJ; orange) with the 
co-crystal structure. Steric clash between the free structure and the loop E motif 
(green) is evident. b, Superposition of the ASL of free tRNA?" (PDB code 
1EHZ, orange) with the co-crystal structure. Note extrusion of tU33. 

c, Comparison of stem-I-bound tRN AG with free RNAP (orange) seen from 
the direction of the elbow. The approximate location of the hinge at the t26¢t44 
pair is indicated. d, Comparison of stem-I-bound tRNA“ with the ribosome- 
bound P/P tRNA (PDB code 4GD2, orange). 


RNA stacks underneath the tRNA-~mRNA wobble pair”’’. Near- 
symmetrically to A90, tA37 stacks on top of the specifier—-anticodon 
duplex. A purine typically occupies tRNA position t37 and is often 
post-transcriptionally modified into a larger nucleobase. The nucleo- 
tide preceding the first specifier nucleotide of stem I (A86) is unstacked 
from the specifier—anticodon duplex because of its participation in the 
sheared AeA pair that initiates the loop E motif of stem I. This unstack- 
ing produces a pocket that could accommodate large, modified t37 
nucleobases (Fig. 2a and Extended Data Fig. 4). 

The two interdigitated T-loops and adjacent nucleotides from the 
distal end of stem I together form a compact structural module con- 
sisting of six stacked layers (Extended Data Fig. 5). The most distal of 
these is formed by three co-planar bases, C44, A56 and G63. In the 
stem-I-tRNA complex, this base triple stacks on the conserved tRNA 
base pair tG19etC56 (Fig. 2b). This conserved tertiary pair joins the 
apices of the tRNA D- and T-loops and forms the ‘elbow of all elon- 
gator tRNAs. Thus, the two T-loops of stem I stack on the T- and 
D-loops of tRNA using the flat molecular surfaces formed by two 
interlocked loops in each RNA (Fig. 2b, c). The nucleobase of tU20, 
another conserved residue from the D-loop of tRNAs, extends the 
interface by stacking on the ribose of stem I residue C64 (Fig. 2d)— 
itself part of a conserved base pair’’. Similar extrahelical stacking of t20 
has been observed in tRNA-ARS complexes’*””. Position t20 is post- 
transcriptionally modified in most tRNAs to dihydroudirine, which 
stabilizes the C2’-endo conformation of the nucleotide”, potentially 
enhancing stacking with the ribose of C64 in stem I. The functional 
importance of several stem-I-tRNA interactions revealed by our co- 
crystal analysis is supported by mutagenesis and ITC analysis (Fig. 2e). 
In particular, alterations to the specifier, the adjacent purine (A90), the 
core and the distal surface of the interdigitated T-loops are all strongly 
detrimental. 

Sequence analysis and modelling”'®’' have previously suggested 
that the two interdigitated T-loops of stem I fold and function similarly 
to the J11/12-J12/11 internal loops of RNase P (ref. 22), and the L1 
stalk in the E-site of the 50S ribosomal subunit”. Our co-crystal struc- 
ture confirms this. However, the stem I T-loops bind to the tRNA 
elbow in the opposite orientation from those of RNase P and the 
ribosome®’ (Extended Data Fig. 6). Because there is no evidence that 
23S rRNA, RNase P and T-boxes share an ancestor, the use of inter- 
digitated T-loops to recognize the tRNA elbow has probably evolved 
independently at least three times. 

Association with stem I distorts tRNA locally and globally. The 
tRNA“ ASL rearranges after T-box binding, probably to avoid steric 
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Figure 4 | Stem I reorganization by tRNA binding. a, Superposition of an 
isolated stem I distal fragment (PDB code 4JRC, black) and the complex. Clash 
between the helix (partly shown as spheres) of the fragment and the tRNA is 
relieved by flexing below the T-loops in the complex. b, Superposition of the 
specifier free (PDB code 2KZL, orange) and bound to tRNA. Specifier 
nucleotides rotate outward, and A90 is displaced. c, Superposition of an isolated 
stem I proximal fragment (PDB code 2KZL, orange) and the complex. tRNA 
binding induces bending of stem I (arrow), bringing its 3’ terminus ~50 A 
closer to the acceptor end of tRNA. 


clash. Its open structure free in solution™ superimposes poorly on that 
in the complex (Fig. 3a, b). Comparison with ribosome structures 
reveals that, similar to tRNAs engaged in translation””*”’, the stem- 
I-bound tRNA®” bends near the t26¢t44 pair at the junction of the 
anticodon and D stems. This bend resembles that of a P/P state tRNA 
(~10° relative to the free tRNA?™®), but is more pronounced (~20°; 
Fig. 3c, d). Thus, in addition to deforming the anticodon loop, the 
T-box seems to exploit the intrinsically flexible t26¢t44 hinge to shift 
the tRNA elbow by ~11 A, bringing it into contact with the distal base 
triple of stem I (Extended Data Fig. 7). 

Stem I also undergoes pronounced conformational changes after 
tRNA binding. A structure” of an apical fragment of stem I shows 
good agreement of the interdigitated T-loops; however, a ~20° hinge 
motion proximal to the T-loops must occur concomitant with binding 
to avoid steric clash with the tRNA (Fig. 4a). Comparison with the 
solution structure”* of a proximal fragment of a tyrS T-box shows that 
specifier residues rotate on average 34° out of the minor groove to base 
pair with the anticodon. This rotation propagates, moving the func- 
tionally crucial A90 by 4A so it stacks under the specifier-anticodon 
duplex (Figs 2e and 4b). The propagation also results in continuous 
stacking of the phylogenetically conserved purines”” (A8-A19) on the 
5’ strand of stem I. Markedly different from the extended structure” 
exhibited in solution in the absence of tRNA, the proximal residues of 
stem I fold into a canonical K-turn in the tRNA complex. This results 
in a marked contraction of the distance separating the 3’-terminus of 
stem I from the acceptor end of tRNA (Fig. 4c). This distance needs to 
be bridged by the short linker separating the glyQS stem I from the 
antiterminator (Extended Data Fig. 1) for the T-box riboswitch to 
function. A role for the K-turn in organizing the large-scale architec- 
ture of the T-box-tRNA complex reconciles its reported requirement 
in vivo® and its lack of contribution to tRNA binding affinity in vitro 
(Extended Data Fig. 2 and Extended Data Table 1). Therefore, the 
mutually induced fit of stem I and tRNA not only gives rise to specific, 
high-affinity binding, but also organizes the overall architecture of the 
5'-untranslated region to orchestrate the aminoacylation-dependent 
transcriptional termination decision. 

Despite having different sequences and being extensively modified 
post-transcriptionally, all elongator tRNAs share a common overall 
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architecture as well as a flexible hinge that allows efficient transit 
through the ribosome. Our T-box-tRNA co-crystal structure reveals 
how a gene-regulatory mRNA domain exploits all these molecular 
features of tRNA in addition to the anticodon sequence for specific 
recognition. Phylogenetic conservation indicates that T-boxes use a 
conserved structural scaffold to recognize divergent tRNAs. Our ana- 
lysis therefore provides a framework with which to understand the 
coevolution of interacting RNA molecules. 


METHODS SUMMARY 


O. iheyensis glyQ T-box stem I was modified for crystallization by deleting one base 
pair from the distal side of the K-turn (above the Al1*G95 pair) and introducing 
three base pairs at its proximal end. O. iheyensis tRNA“ cco was circularly per- 
muted and its acceptor stem was capped with a GAAA tetraloop (Fig. 1d). The two 
RNAs and one equivalent of selenomethionyl B. subtilis YbxF (200 1M complex in 
a buffer consisting of 50 mM HEPES-KOH, pH 7.0, 100mM KCl, 20 mM MgCl, 
and 5 mM tris(2-carboxyethyl)phosphine (TCEP)) were co-crystallized at 21 °C 
by vapour diffusion by mixing with an equal volume of a reservoir consisting of 
50 mM Bis-Tris (HCl), pH 6.5, 300 mM Li,SO, and 20% (w/v) PEG3350, supple- 
mented with 0.2% (w/v) low-melting-point agarose. Co-crystals with the sym- 
metry of space group C222, were incubated in a harvesting solution consisting of 
50mM Bis-Tris (HCl), pH6.5, 100 mM KCl, 20-50mM SrCl,, 40-45% (w/v) 
PEG3350 and 5mM TCEP for 16h before flash-freezing in liquid nitrogen. 
Diffraction data were collected at 100K at beamlines 5.0.1 and 5.0.2 of the 
Advanced Light Source (ALS), and beamlines 24-ID-C and 24-ID-E of the 
Advanced Photon Source (APS). Single wavelength anomalous dispersion 
(SAD) phases from selenomethionine and Sr°* ions were combined with molecu- 
lar replacement phases obtained using tRNA and a K-turn-YbxF complex as 
search models to produce initial electron density maps. The structure, which 
contains two stem-I-tRNA°”-YbxF ternary complexes per crystallographic 
asymmetric unit, has been refined to Rfree = 25.2%, and has a mean coordinate 
precision of 0.47 A (Extended Data Table 2). The two ternary complexes in the 
asymmetric unit superimpose closely. Biochemical and ITC analysis of the T-box- 
tRNA interaction was performed using RNAs derived from the B. subtilis glyQS 
sequence (Extended Data Fig. 1). 
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METHODS 


RNA and protein preparation. Biochemical and ITC analysis of the T-box- 
tRNA interaction was performed using RNAs derived from the B. subtilis glyQS 
sequence (Extended Data Fig. 1). For crystallization, the O. iheyensis glyQ T-box 
stem I RNA was modified by deleting one base pair from the distal side of the 
K-turn (above the A11-G95 pair) and introducing three base pairs at its proximal 
end. The O. iheyensis tRNA@" gcc was circularly permuted, and its acceptor stem 
was capped with a GAAA tetraloop (Fig. 1d). These alterations did not adversely 
affect the stem-I-tRNA association (Extended Data Fig. 2h and Extended Data 
Table 1). B. subtilis glyQS T-box and tRNA“constructs, O. iheyensis glyQ T-box 
stem I RNA (A89C), and O. iheyensis tRNACY (Fig. 1d) were transcribed in vitro 
using T7 RNA polymerase from double-stranded DNA templates generated by 
PCR, as described*’, except that 2 ml of PCR reaction product was used for each 
5-ml transcription. RNAs were purified by electrophoresis on 8% polyacrylamide, 
8M urea gels (29:1 acrylamide:bisacrylamide), electroeluted, washed once with 
1M KCl, desalted by ultrafiltration, and stored at 4 °C. Selenomethiony] B. subtilis 
YbxF was expressed and purified as described". 

ITC. To suppress oligomerization, tRNA” was heated to 90 °C in water for 3 min 
and cooled to 4°C over 2 min immediately before use. RNA samples for ITC 
experiments were equilibrated in a buffer consisting of 100 mM KCI, 50mM 
HEPES-KOH, pH 7.0, and 10 mM MgCl, by ultrafiltration. ITC experiments were 
performed at least in duplicate at 20°C with 20 uM T-box RNA in the cell and 
200 4M tRNA or ASL in the syringe, using a MicroCal iTCz99 microcalorimeter 
(GE). Initial data analysis was performed as described** *’; then, data from replic- 
ate experiments were fitted globally***° using SEDPHAT and NITPIC to arrive at 
the dissociation constants and uncertainties in Extended Data Table 1. 
Co-crystallization and diffraction data collection. Engineered O. iheyensis 
tRNA®Y (75 nucleotides) was heated to 90°C in water for 3 min and cooled to 
4°C over 2 min, mixed with one equivalent of stem I RNA and incubated in the 
presence of 50 mM HEPES-KOH, pH 7.0, 100 mM KCl, 20 mM MgCl, and 5 mM 
tris(2-carboxyethyl)phosphine (TCEP) at 50°C for 10 min and then at 37 °C for 
30 min. One equivalent of selenomethiony] B. subtilis YbxF was then added. The 
solution was adjusted to 200 1M complex, 2mM spermine, 0.2% (w/v) low- 
melting-point agarose and held at 37°C. For crystallization, the complex was 
mixed 1:1 with a reservoir solution consisting of 50 mM Bis-Tris (HCl), pH 6.5, 
300 mM Li,SO, and 20% PEG3350. Plate-shaped crystals grew in 1-4 weeks to 
maximum dimensions of 300 X 300 X 50 jum*. After incubation in 50 mM Bis- 
Tris (HCl), pH 6.5, 100 mM KCI, 20-50 mM SrCl,, 40-45% PEG3350 and 5 mM 
TCEP for 16h, crystals were dissected out of the agarose using MicroSaws 
(Mitegen), mounted on 90° bent MicroLoops (Mitegen), and vitrified by plunging 
into liquid nitrogen. Co-crystals have the symmetry of space group C222,. Unit 
cell dimensions are in Extended Data Table 2. Single-wavelength anomalous 
dispersion (SAD) data were collected at the selenium K edge (0.9792 A) at 
100K at beamlines 24-ID-C and 24-ID-E of the Advanced Photon Source 
(APS), and indexed, integrated, and scaled using the NE-CAT RAPD pipeline 
that uses XDS*’ and Scala**. Additional data were collected at beamlines 5.0.1 
and 5.0.2 of the Advanced Light Source (ALS), and processed with HKL2000 
(ref. 39). Data collection statistics are summarized in Extended Data Table 2. 
Structure determination and refinement. Four selenium sites were identified 
from crystal I data using SHELX-D (ref. 40) through HKL2MAP (ref. 41). The 
resulting SAD phases were combined with model phases resulting from placing 
two truncated search models each of tRNA“ (PDB code 3L0U) and K-turn bound 
by YbxF’? (PDB code 3V7E) using MOLREP”, in a strategy similar to that 
described**. Subsequently, the composite ensemble (two copies each of tRNA 
and the K-turn-YbxF complex) was located in a merged, high-redundancy data 
set (crystals I-II-III) using PHASER”, allowing completion of the anomalous 
atom substructure, which consists of eight selenium atom sites and three Sr 
ions. SAD (mean overall figure of merit = 0.36) and model phases were combined 
using the molecular replacement-SAD pipeline implemented in PHENIX” and 
density-modified using RESOLVE”, producing a substantially improved electron 
density map (Extended Data Fig. 8) with which manual model building could 
begin. Iterative rounds of manual model building’, molecular replacement, and 
phase-combination produced a near-complete model. This model was then sub- 
jected to additional manual building interspersed with iterative rounds of rigid- 
body, simulated-annealing, and individual isotropic B-factor refinement against 
crystal I data using PHENIX. Refinement statistics are summarized in Extended 
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Data Table 2. The maximum likelihood coordinate precision of the current model 
is 0.47 A. Ramachandran analysis shows that 92.4% of the amino acid residues are 
in the most favoured regions, 7.0% in allowed regions, and 0.6% (one residue) in 
disallowed regions. The two stem I and tRNA molecules in the asymmetric unit 
superimpose closely (root mean squared deviation (r.m.s.d.) of 1.1 and 0.7 A, 
respectively). In the crystals, the flat surface formed by the interdigitated 
T-loops on the face opposite the tRNA elbow stacks on both the apical nucleobase 
of the GAAA tetraloop of the engineered tRNA, and the terminal (most proximal) 
base pair of stem I from symmetry-related molecules. 

Comparisons were performed against the following structures: tRNA ASL 
(PDB code 2LBJ, ref. 24); tRNAP"* (PDB code 1EHZ, ref. 49); the 70S ribosome 
with a P/P tRNA (PDB code 4GD2, ref. 26); a distal fragment of stem I (PDB code 
AJRC, ref. 21); a proximal fragment of stem I (PDB ID: 2KZL, ref. 28); the loop E 
motif of 5S rRNA (PDB code 354D, ref. 50); RNase P holoenzyme bound to tRNA 
(PDB code 3Q1Q, ref. 6); the L1 stalk of the 50S ribosomal subunit (PDB code 
1MZP, ref. 23); and the 70S ribosome with an E/E tRNA (PDB code 1VSA, ref. 7). 
The superposition shown in Fig. 4a used the interdigitated T-loops of the two 
structures. Those shown in Fig. 4b, c were generated by overlaying the loop E 
residues of the two structures. In both cases, r.m.s.d. was less than 1A. Solvent 
accessible surface areas were calculated with a probe radius of 1.4 A. Structural 
figures were prepared with PyMol”, using chains A (YbxF), B (tRNA®) and C 
(glyQ stem I). 
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Extended Data Figure 1 | Sequence of the B. subtilis glyQS T-box. Unlike 

some other T-boxes, tRNA“”-responsive T-boxes (such as the B. subtilis glyQS 
T-box and the closely related O. iheyensis glyQ T-box) do not contain stems II 
and II A/B in the linker region. Conservation across known T-boxes is indicated 
in orange and yellow circles (from ref. 11). Base pairing between the specifier 


sequence in stem | and the antiterminator domain of the T-box and the 
anticodon and acceptor end, respectively, of tRNA®Y is indicated. The 
functionally important K-turn® is boxed. The sequences of tRNA°Y of B. 
subtilis and O. iheyensis are identical. 
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Extended Data Figure 2 | Representative isothermal calorimetric titrations. 
a-f, Full-length tRNA” (a-e) and an isolated tRNA” ASL (f) were titrated 
into various truncated B. subtilis glyQS T-box stem I constructs (Methods). The 
specifier trinucleotide GGC is in bold. g, Superposition of the experimental data 


and fits for the experiments shown in Fig. 1a—c, colour-coded to correspond to 
schematic RNA construct depictions. h, ITC analysis of binding by the 
engineered tRNA“ construct used for co-crystallization. Results of all fits 
(Methods) are in Extended Data Table 1. 
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Extended Data Figure 3 | Intermolecular interface of the stem-I-tRNA blue or white (no burial) to red (>75 A” per residue). d, Open-book view. 
complex and phylogenetic conservation. a, The residue 29-30 bulge allows _ e, Surfaces coloured by phylogenetic conservation of stem I residues (see 
the O. iheyensis stem I to bend and track the tRNA backbone. b, Structure’! of | Extended Data Fig. 1). Conservation of tRNA is not indicated. f, Open-book 
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T-box stem | 


Extended Data Figure 4 | T-box can accommodate bulky post- 
transcriptional modifications at tRNA position 37. A86 of the T-box stem I 
is unstacked from the duplex formed by pairing of the tRNA anticodon and the 
T-box specifier by its participation in the sheared AeA pair at the base of the 
loop E motif. This results in a pocket (dotted line) that could accommodate the 
large, modified nucleobases (for example, the three-ring heterocycle wybutine 
in yeast tRNA’"*) often present at position 37 of tRNAs (Fig. 2a). Note the 
single hydrogen bond between tU32 and tA38. 
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Extended Data Figure 5 | Structural organization of the interdigitated 
distal domain core (view as in Fig. 2b). Dashed black lines denote base-base 


hydrogen bonding. Dashed yellow lines denote Sr’ * -ion—water (green and 
red spheres, respectively) interactions. Leontis-Westhof symbols” depict 
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T-loop 2 (5) 


View from the tRNA elbow into the layers 
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non-canonical pairing. b, Orthogonal view. c, View from the tRNA elbow. 
T-loop motifs of the T-box stem I. a, Six base-paired layers form the stemI _d-i, Detail of layers 1-6. The position of each nucleotide within the pentaloop 
(1-5) or outside of it (negative and positive denote 5’ and 3’ directions, 
respectively) is in parentheses (Supplementary Discussion). 


RNase P 
J12/11 T-loop 2 


L1 Stalk E-site tRNA 


T-loop 2 tT-loop 


A2112 


tD-loop 


tRNASY T-box stem | 


Extended Data Figure 6 | Comparison of the interdigitated T-loops of 
RNase P, the L1 stalk and the T-box. a, b, RNase P (a) and the L1 stalk (b) use 
equivalent surfaces formed by two purines—namely, nucleotide —2 of T-loop 1 
and nucleotide 3 of T-loop 2 (asterisk)—to stack on the elbows of pre-tRNA 
and the E-site tRNA, respectively. c, By contrast, the T-box uses the opposite 
surface, using a triple (Fig. 2c) formed by nucleotide 3 of T-loop 1 (dagger 
symbol), nucleotide —2 from T-loop 2, and nucleotide +1 from T-loop 2 to 
stack on the tRNA elbow. 
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LETTER 


T-loop 2 


Extended Data Figure 7 | Displacement of the tRNA elbow induced by clashes with the stem I distal domain. T-box binding induces bending about the 
T-box binding. Although the anticodon stem of the free tRNA’® (orange) t26¢t44 hinge (Fig. 3c) that displaces the tRNA elbow and acceptor stem away 
superimposes well on that of the T-box-bound tRNACY (T-loop and acceptor from the distal region of stem I by 11 A (arrows), allowing the tRNA elbow to 
stem in violet; D-loop and ASL in pale green; Fig. 3b, c), the elbow (tG19-tC56) associate with the T-box. a, Side view. b, Top view. 
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tD-loop 


combination of SAD and molecular replacement (using a search model 


Extended Data Figure 8 | Representative electron density maps. Portions of 
consisting of two copies each of tRNA’"® and a K-turn in complex with YbxF; 


electron density maps corresponding to the interdigitated stem I T-loops and 
the tRNA elbow (oriented like Fig. 2b). a, Unbiased, 3.2-A-resolution density- | Methods) phases followed by density modification (1.5 s.d.) c, Composite 


modified SAD electron density (1.0 s.d. above mean peak height) shown as grey _ simulated anneal-omit 2|F,| — |F.| electron density calculated using the final 
mesh superimposed on the refined model. b, Electron density resulting from — model (2.0s.d.). 
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Extended Data Table 1 


Thermodynamic parameters from ITC analysis 


T-box * tRNASY Ka (uM) log[Ka] AH (kcal mol’) -TAS (kcal mol") + 
14-182 (wt) ¢ wt 0.209 + 0.025 6.68 -29.9 21.0 
14-158 (wt) § wt 0.194 + 0.055 6.72 -41.2 32.2 
14-113 (wt) || wt 0.150 + 0.018 6.82 -25.1 15.9 
14-113del_kturn J] wt 0.097 + 0.008 7.01 -23.6 14.2 
A29-30 (A42-43) wt 1.84 + 0.42 5.74 -27.0 19.3 
G43U (G55U) wt 1.68 + 0.16 5.78 -39.7 31.9 
C44U (C56U) wt 8.204 2.9 5.08 -45.3 38.5 
C44A (C56A) wt 0.307 + 0.038 6.51 -30.7 21.9 
C44G (C56G) wt 34423 5.47 -42.1 34.7 
A42G/A46G (A54G/A58G) wt 9.0421 5.05 -30.0 23.2 
G56U (G68U) wt 4640.44 5.34 -32.0 24.8 
G56A (G68A) wt 0.406 + 0.060 6.39 -33.2 24.6 
G59U (G71U) wt 0.538 + 0.12 6.27 -45.4 36.9 
G59A (G71A) wt 1.69 + 0.30 5.77 -43.0 35.2 
A61U (A73U) wt 0.125 + 0.019 6.90 -35.4 26.1 
U58G/A62G (U70G/A74G) wt 126483 4.90 -28.2 21.6 
G63C (G75C) wt 8.7 + 0.84 5.06 -30.0 23.2 
C44G/G63C (C56G/G75C) wt 19415 5.72 -39.8 32.2 
C44A/G63A (C56A/G75A) wt 0.433 + 0.31 6.36 -32.0 23.5 
G87U (G99U) tC36A 76442 5.12 -17.2 10.4 
A90C (A102C) wt 0.641 + 0.167 6.19 -26.9 18.6 
A90G (A102G) wt 0.070 + 0.012 7.15 -31.5 21.9 
14-113 (wt) tA37U 0.125 + 0.018 6.90 -35.4 26.1 
14-113 (wt) ZIGAAA# 0.083 + 0.016 7.08 -31.8 22.3 


* Residue numbers are for O. iheyensis glyQ T-box. Structurally equivalent B. subtilis glyQS T-box numbers are in parentheses. 


+All experiments were performed at 293 K. 

tThis experiment corresponds to the titration shown in Fig. la. 

§ This experiment corresponds to the titration shown in Fig. 1b. 

\This experiment corresponds to the titration shown in Fig. 1c and Extended Data Fig. 2a. 
This experiment corresponds to the titration shown in Extended Data Fig. 2b. 
#Engineered crystallization construct. 
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Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, B,y (°) 
Resolution (A) 
Rmerge (%) 
<I>/<o(1)> 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
No. reflections 
Ruworks Riree (Yo) 
No. atoms 
RNA 
Protein 
lon 
Water 
Mean B-factors (A’) 
RNA 
Protein 
Ligand/ion 
Water 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 


One crystal was used for each of the three data sets. 
* Highest resolution shell is shown in parenthesis. 


Crystal | 


C222, 


100.3, 108.4, 266.8 
90, 90, 90 

28.42-3.20 (3.31-3.20)* 
6.2 (104.8) 

11.1 (1.5) 

98.9 (98.8) 

6.1 (6.2) 


28.42-3.20 (3.31-3.20) 
24325 (2369) 

19.6 (33.6) /25.2 (38.2) 
8742 

7586 

1048 

86 

22 

128.0 

118.7 

192.7 

176.4 

64.2 


0.001 
0.37 


Extended Data Table 2 | Data collection and refinement statistics 


Crystal Il 


C222, 


99.1, 107.5, 266.1 
90, 90, 90 

133.1-3.87 (4.08-3.87) 
6.2 (67.1) 

14.3 (2.2) 

99.8 (99.7) 

4.8 (4.8) 


Crystal III 


C222, 


100.3 ,108.3, 267.8 
90, 90, 90 

133.9-3.76 (3.97-3.76) 
6.7 (59.5) 

14.2 (2.3) 

99.4 (99.2) 

4.8 (4.7) 
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Crystals I-Il-III 
C222, 


100.0, 108.1, 266.9 
90, 90, 90 
134-3.43 (3.61-3.43) 
10.8 (82.8) 
23.7 (2.3) 
99.3 (96.0) 
12.8 (6.2) 
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CAREERS 


It is not easy to let go of a scientific 
identity after leaving the lab p.369 


US online science recruiting 
continues to grow in 2013 p.369 


For the latest career 
listings and advice www.naturejobs.com 


A macrophage white blood cell (centre) engulfs and destroys bacteria (orange) and spews out the remnants. 


Industrial immunology 


Inatoughjob market, immunologists are in demand. To move from academia to a 
biotechnology or drug company, researchers must explore the options and stay flexible. 


BY ALLA KATSNELSON 


livia Schneider realized early in her 
() graduate work on immune-cell signal- 

ling that she had no interest in becom- 
ing an academic researcher. “I didn’t want to 
work in a lab, or to write grants,” she says. In 
2009, when she finished her PhD at the Uni- 
versity of Cincinnati in Ohio, the global reces- 
sion was in full swing and employment options 
looked scarce. Her husband had a well-paid job 
in the area, so relocating was impractical. Then 
Schneider saw an advert for part-time work 
through a local contract-research organization, 
doing tissue culture and cloning for a recombi- 
nant-protein manufacturer called Shenandoah 


Biotechnology in Warwick, Pennsylvania. “I 
just wanted to get my foot into biotech in some 
regard,” she says. “I took this position — that I 
was way overqualified for — with the hope that 
it would turn into something else.’ 

And it did. The project involved using 
antibody-based kits to test the quality of 
Shenandoah’s products. Flying back and forth 
to give talks on progress, Schneider developed 
a rapport with Shenandoah’s owners. When the 
project ended unsuccessfully, they invited her to 
join their firm, working remotely as an associate 
director of research and development. Fifteen 
months later, she became chief scientific officer. 

Schneider now brainstorms new products 
on the basis of feedback from researchers. She 


manages quality control, represents the com- 
pany at trade shows and conferences, trains 
sales personnel and even tackles marketing and 
customer relations. The variety suits her. “I try 
to tell people, try to pick out what you liked in 
graduate school. There is a job associated with 
that,” she says. As a student, Schneider had loved 
giving presentations and talking to people about 
their own work and science in general, but had 
little interest in actually analysing data. “I kind 
of created a job that reflects that,’ she says. 
Industrial opportunities for early-career 
scientists have decreased in the wake of the 
recession, but in some quarters, immunology 
has done better — especially when compared 
to fields such as neuroscience. Immunology | 
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> and oncology are, for example, the most 
active fields for recruitment at the executive 
level, says Kenneth Israel, founder of Search 
Group of San Diego, a recruitment firm in Cali- 
fornia that focuses on executive-level positions 
in the life sciences. 

The interest may be fuelled by companies 
increasing investment in biological drugs 
such as antibodies, says Ravi Chari, execu- 
tive director of chemistry and biochemistry 
at ImmunoGen, in Waltham, Massachusetts, 
which develops antibodies for treating can- 
cer. “Even traditional pharma companies like 
Merck and Eli Lily are now getting into immu- 
nology,’ he says. Jonathan Deane, a researcher 
at the Genomics Institute of the Novartis 
Research Foundation in San Diego, notes that 
several immunomodulating drugs have been 
approved in the United States in recent years, 
and there has been a stream of positive news 
about immune-based cancer treatments. Still, 
the job market is affected by broader industry 
forces, he warns. “Companies have their budg- 
ets — that’s just global,” he says, noting that 
many are making cuts. 


CLEAR GOALS 

In the current climate, immunologists need to 
be clear about the kind of job they are look- 
ing for. One niche is research positions, either 
at drug-development companies or at firms 
developing reagents and research materials 
for use by academic or industry scientists. But 
there are options beyond the bench, too — 
such as medical-liaison officers, who cultivate 
contacts in a particular therapeutic field to cre- 
ate awareness of a product or company. 

For research posi- 
tions, PhD holders can 
expect annual starting 
salaries ranging from 
€45,000 (US$59,000) 
to €65,000 in Europe, 
says David Mari- 
muthu, senior recruit- 
ment consultant at 
SIRE Life Sciences, 
a staffing company 
based in Amsterdam. 
However, he adds, 


someone with a few “We are really 
years of industry expe- linked into . 
rience or specialized the academic 
knowledge inaniche community.” 
area could get double Kerry Casey 


that — or even more 
with management experience. Starting research 
salaries are closer to about US$80,000 in the 
United States, says Phil Mericantante, co-owner 
of Adante Staffing in Woburn, Massachusetts, 
which specializes in the life sciences. Medical- 
liaison officers might start at between $85,000 
and $100,000 depending on the size of the com- 
pany, and regulatory positions bring in some- 
thing like $50,000-60,000, says Mericantante. 
Recruiters do not tend to work with 


Ravi Chari says the best applicants have explored 
several areas of research. 


entry-level applicants, because companies are 
not generally interested in paying finders’ fees 
for such jobs, says Marimuthu. But it is a dif- 
ferent story for people with highly specialized 
research backgrounds. “Companies develop- 
ing one product, if they find a real genius in 
their field, they are willing to pay,’ he says, and 
recruiters may be useful. 

As with most career transitions, networking 
is a tried and true way to learn about the types 
of opportunities available and to land a job. Job 
seekers should do more than just show up to 
events, advises Schneider. They should seek out 
people from interesting companies at confer- 
ences, and mine their university career centre's 
alumni list for contacts. Applicants can request 
informational interviews to find out about the 
company, and then ask who else they should 
speak to. “Once you know you want to go after 
a particular career, you really have to do your 
homework,’ says Michael Crowley, director of 
business development at the biotechnology 
firm Genentech in South San Francisco, Cali- 
fornia. “You have to know the company, and 
you have to bring really good energy to the 
interviews. If interviewers are not sensing the 
investment, they will just write you off” 

Asan undergraduate, Crowley was “devoutly 
focused on immunology” but, like Schneider, he 
realized during his doctorate that he did not pic- 
ture himself running an academic lab. In 1999, 
a year after finishing his PhD on immune cells 
called y6 T cells at Stanford University in Cali- 
fornia, he fell into a consulting job at the now- 
defunct Cellegy Pharmaceuticals. He loved the 
operations side of drug development, so he 
studied for a business degree, and Genentech 
hired him in 2003, after a summer internship. 
Now he leads a three-person team that seeks 
out research collaborations and licensing deals 
relating to immunology and infectious diseases. 
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Business-development positions differ 
depending on the size of the company, Crowley 
notes. At Genentech, he focuses on what he 
calls the “buy-side” — bringing in intellectual 
property on the basis of a wish list provided by 
company researchers. For example, in February, 
his group signed a deal with a San Diego start- 
up that had found an innovative way to pursue 
a target of interest. “Small companies are the 
sellers, and that’s a different kind of job; he says. 

Crowley adds that he does not feel as if he 
abandoned the work he loves to go into busi- 
ness. “I’m still very close to the science here,” he 
says. “But it’s also about communication and 
relationships, which I felt I couldn't get at the 
bench.” Job seekers should decide from the start 
whether they want to remain in the lab or forge 
anon-research career, he says. “If youre ambiv- 
alent, it will hold you back in both realms.” 

Those who do want to stay at the bench 
should attempt to broaden out beyond a single 
field of expertise, says Chari. He has a PhD in 
organic chemistry but did his postdoctoral work 
in protein chemistry, enzymology and related 
fields. His broad background helped him to get 
hired at ImmunoGen in the late 1980s. 


DIFFERENT STROKES 

Company cultures take some getting used to. 
Probably the biggest difference from academia 
is team involvement, says Deane. Developing 
a drug involves multiple teams working in 
concert, he says: one group might be doing 
the immunology, but at the same time a sec- 
ond group is looking at protein expression, a 
third at X-ray crystallography and a fourth at 
pharmacodynamics. Each group must stay in 
close communication with the others, and with 
managers guiding the project. 

Then there is the issue of publishing research. 
Some companies encourage it, whereas others 
prohibit it. Kerry Casey, a research scientist at 
MedImmune in Gaithersburg, Maryland, the 
biological-drugs research and development arm 
of AstraZeneca, feels fortunate to be at a com- 
pany that promotes publishing. In her two years 
there, she has worked on a high-priority project 
in respiratory inflammation and immunity, and 
she currently has two manuscripts in the works. 
MedImmune is close to the US National Insti- 
tutes of Health (NIH) in Bethesda, Maryland, so 
Casey can easily pop over for conferences. Many 
of her co-workers once did research at the NIH, 
and still have colleagues there. “We are really 
linked into the academic community,’ she says. 

Most industrial researchers work under 
some corporate limitations, however. “As a uni- 
versity professor, you are your own small busi- 
ness and your own chief executive,’ says Lewis 
Lanier, a microbiologist and immunologist at 
the University of California, San Francisco. 
“Companies are dictatorships — unless you're 
the chief executive, you're not in charge.’m 


Alla Katsnelson is a freelance writer in 
Northampton, Massachusetts. 
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Still a scientist 


Itis not easy to let go of ascientific identity after leaving 
the lab — nor is it necessary, says Chandrika Nair. 


still be a scientist? That question often 
occupies my thoughts as I consider my 
options post-PhD. 

Imagining a future outside research is 
not easy: I have become accustomed to this 
environment. But it makes sense, given 
that more than half of current science PhD 
students will leave the lab and spread their 
wings elsewhere (see Nature 475, 533-535; 
2011). Perhaps a science-related job could be 
a workable compromise. 

Scientific research relies ona large network 
of support, from teachers to research coun- 
cils and charities. There are also editors, jour- 
nalists and communicators, policy-makers 
and campaigners. A job in any of these areas 
could tick the science box — keeping up with 
research trends as a journalist, for example, 
or reviewing evidence as a policy-maker. 

But would I want to be the person facilitat- 
ing science rather than doing it myself? Should 
I exchange my lab coat for a suit? My primary 
motivation for considering other career paths 
is that experimental work can be lonely. As I 
sit pipetting at a fume hood or staring at cells 
ina darkened room, I admit, I crave human 
interaction. Still, academic research has many 
advantages, and the one I treasure most is free- 
dom. Many researchers have flexible schedules 
and intellectual independence. And the idea of 
working on basic ‘blue skies’ research appeals 
to me. Few other jobs pay their employees to 
indulge their child-like curiosity, to pursue 
projects that may or may not succeed. 

If1 do leave the bench, will I forfeit my iden- 
tity as a scientist? Can a person be a scientist 
and not work in a lab? The scientist in me 
decided to address this deep, dark question 
by gathering some data. I surveyed nine peo- 
ple working in science-related fields, includ- 
ing a medical charity, a learned society anda 
research-funding body. The results? Although 
they no longer consider themselves scientists, 
all respondents said that their science back- 
ground has stayed with them. That knowledge 
and training are relevant to their job on a scale 
varying from ‘occasionally’ to ‘routinely. 

But what interested me most was the con- 
sensus that it is important to have a scientific 
outlook. The people I asked said that their 
training affects how they approach problems 
at work, encouraging them to seek evidence 
and test ideas. They have exported aspects 


|: I choose a career outside the lab, will I 


of the scientific method to real-life settings. 
One person said that they typically question 
and research sources of information more 
than their colleagues who do not have scien- 
tific backgrounds. 

I do not know what career I will be pursu- 
ing in several years’ time. One highly desir- 
able scenario would be presenting exciting 
findings at a conference with several high- 
impact papers under my belt, maybe a grant 
or two, and a permanent position within 
reach. However, Iam coming to the pleasant 
realization that I could easily find fulfilment 
in other occupations, especially if I feel that I 
am putting my scientific training to good use. 

Myvery preliminary data suggest that peo- 
ple who have left research might not fit the 
conventional definition ofa scientist, but still 
retain the identity of scientists by training. 
As someone who often wishes that impor- 
tant aspects of our lives — from advertis- 
ing to health care and politics — were more 
evidence-based, I welcome the possibility of 
science-literate people having diverse roles 
in society. Perhaps I will even still be able to 
call myself “a scientist”. = 


Chandrika Nair is a PhD student in 
microbiology at Imperial College London. 


CORRECTION 

The Careers Feature ‘Middle Eastern 
promise’ (Nature 500, 111-112) wrongly 
stated that the New York University Abu 
Dhabi offers master’s-level courses. It 
does not. 
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HIRING 
US job ads increase 


There were 6% more advertisements for 
US science-related positions posted online 
in June 2013 than in June 2012, following a 
22% increase from 2011 to 2012, according 
to an analysis of 25,000 recruitment 
sources including LinkedIn and 
CareerBuilder. The study, released on 

31 July by Wanted Analytics in Quebec 
City, Canada, also found that the number 
of employers advertising jobs was up 
slightly in 2013 after a 15.8% increase from 
2011 to 2012. The number of academic 
posts advertised, however, was down 
16.5% from 2012. The biggest increases 

in recruitment were in San Francisco and 
Los Angeles, California. In both areas, 
demand rose by almost one-fifth, owing in 
part to increased hiring at the University 
of Southern California and at biomedical 
research firms including Genentech. 


DATA-SHARING 
Private beats public 


Researchers in some fields are more likely 
to share data in response to personal 
requests than by routinely submitting to 
public repositories, says a study (J. C. Wallis 
et al. PLoS ONE 8, e67332; 2013). The 
authors conducted 43 interviews with 
researchers who use wireless sensing, 
including biologists and seismologists; 

in more than two-thirds, participants 

did not report submitting to repositories, 
sometimes because a suitable one did not 
exist or because they wanted to know how 
data would be used. Co-author Jillian 
Wallis, an information-studies researcher 
at the University of California, Los Angeles, 
emphasizes that using repositories makes 
data easier to find and increases citations. 


ACADEMIA 
US health premiums rise 


More than one-quarter of US academic 
institutions said in a survey that they had 
raised the costs of health-care premiums 
for faculty members and staff this year in 
preparation for the Affordable Care Act, 
which is set to ramp up in 2014. The 2013 
Employee Health Benefits Survey of Higher 
Education by the College and University 
Professional Association for Human 
Resources in Knoxville, Tennessee, found 
that premiums rose by an average of 
3.25% for employee-only plans and 5% 
for employee-family plans. Two-thirds of 
respondents said that they are still trying 
to determine which adjunct or part-time 
faculty members are eligible for benefits. 
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TIME HEALS ALL WOUNDS 


BY GRACE TANG 


annah’s hair glowed amber in 
H the porch light as I watched her 

through the semicircular pane of 
frosted glass. In the next second, the front 
door would click shut, cutting her off from 
my life. I willed myself to rise, to go after her 
and stop her, but movement was something 
that belonged to the next second, and the 
next second would not come. 

Time had, as expected in my condition, 
slowed dramatically, and in this state I 
observed the world in excruciating detail: 
the scraps of ‘meat’ that had fallen from our 
TV-dinner trays were spreading pools of oil 
on the blue paper of the unpaid hospital bills. 
The unkempt pile lay strewn across the cof- 
fee table, and in the centre of the heap lay a 
half-carat diamond ring, the band scratched 
and faded from years of wear. 

Two decades ago, on the last day that time 
still flowed at a regular pace regardless of my 
emotional state, the ring had been new and 
unblemished, sitting inside a ring box on my 
doctor’s desk. 

“Tm going to return it today,’ I choked, 
my forehead resting on my palm, trying 
desperately to hide my emotions, “it’s not 
fair to Hannah.” 

Doctor Kwon sat opposite me in silence. I 
knew he wanted to tell me that he could up 
the dose of the medication again, and that 
the voices and hallucinations would stop, 
allowing me to lead a normal life. I knew 
he wanted to say that my disease would not 
gradually chip away at my personality till all 
that was left of me was a raving lunatic who 
only externally resembled my former self. 

But all Doctor Kwon could do was put his 
hand on my shaking one. He hesitated. 

“Dan, there is something that just may 
work? 

I looked up, hopeful. 

“Now dont get too excited. It’s an experi- 
mental procedure, a brain surgery, never 
tried before in humans. A team in this hospi- 
tal was going to start looking for volunteers.” 

I snatched the ring off the table and 
pushed it back into my jacket pocket. 

“Sign me up.” 

At first it seemed as though my halluci- 
nations had stopped after the surgery, but 
perhaps it was just 


that the nature of my DNATURE.COM 
hallucinations had _ FollowFutures: 
changed. My first © @NatureFutures 


hot bath since the & go.nature.com/mtoodm 


Seconds out. 


operation was glorious. I soaked in the water 
for what felt like hours, until I was worried 
Hannah would start to wonder if I had 
passed out. Hannah eyed me with concern 
when I emerged, dripping. 

“Something wrong, dear? Did it hurt?” 
She gingerly caressed my head near the site 
of the surgery. 

“No, that felt great.” 

“Then why did you come out right away?” 

All too soon I was back in Doctor Kwon’s 
office. The team that had operated on me 
stood by, fascinated. He held a stopwatch 
in one hand and offered me a piece of gum 
from a bowl in the other. Under his instruc- 
tions, I put the gum in my mouth. I enjoyed 
the mint flavour immensely, but he remained 
silent, watching me eat until it became awk- 
ward to continue. 

“Uh, should I keep going?” 

He clicked the stopwatch off. 

“How long have you been eating?” 

“I don't know, five minutes?” 

“Fascinating...” 

“What?” 

Out of nowhere, he hit the side of my arm. 
It stung. 

“Ow! What was that for?” 

Again, he stood there dumbly for the long- 
est time before responding. 
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“Sorry. How long did the pain last when 
[hit you?” 

“Three minutes?” 

“Tt seems that a side effect of the surgery is 
that emotional responses slow your percep- 
tion of time. I gave you that piece of gum no 
more than two minutes ago...” 

It was trivial in the beginning — seconds 
could stretch into minutes — but my condi- 
tion gradually got worse. After a year, a few 
seconds could drag like hours when I got 
emotional. It got to the point where I could no 
longer tell time enough to function. I lost my 
job, and my main source of income was the 
chump change time-perception researchers 
threw at me when they needed a guinea pig. 

Tonight, Hannah had come back from her 
second job exhausted. 

“Dan,’ she sighed, putting her face in her 
hands. 

I could sense the distress in her voice, 
which alone was enough to trigger enough 
of an emotional response that time began to 
slow. 

“Yes ... deaaar?” 

Like a drunk at a party, vaguely aware of 
my slurred speech, I struggled to speak at a 
normal rate, unsure if I was succeeding. 

She inhaled sharply, her voice cracking. 

“Tm sorry...” 

She took off her ring for the first time in 
20 years. Time crawled slower than it ever 
had, nearly still. Was my condition deterio- 
rating, or had I never felt such a strong emo- 
tion in my life? The diamond sparkled like 
a tiny glacier as it drifted through the ocean 
of space between her hand and the top of the 
coffee table. 

Now the halo-like rays from Hannah’s 
blonde crown faded like the setting sun as 
she descended the front stairs of the porch. 

The wall clock ticked, deafening. With 
the arrival of the new second, the door 
clicked into place, but I had not moved an 
inch. Hannah disappeared from view. My 
eyelids fell, the first stage of the 300-milli- 
second sequence of an eye blink. The flow 
of air from my lungs slowed as I reached the 
nadir of my breath. If the Universe would 
progress another hundred milliseconds, my 
eyes would open again, and I would take my 
next breath. I waited and waited, but neither 
happened. I knew, like they said, that time 
would heal this wound, but time had ceased 
to flow. m 
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